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SOME PRELIMINARY POSTULATES ON STIMULUS- 
PERCEPTION, AND CONSEQUENT PROBABILITY 
DISTRIBUTIONS FOR BEHAVIOR* 


ROBERT DEAN GORDON 
Scripps Institution of Oceanography 


A stimulus (or stimulus-complex) is pictured as giving rise to a 
random series of sensory nerve “pulses,” which manifest themselves 
in contractions of individual muscle fibers. Assuming the expected 
time-frequency of these pulses to be proportional to the intensity of 
the stimulus, probability distributions are computed representing the 
cumulative effect of these pulses on the state of the organism, that 
is, on its degree of awareness of the stimulus. Preliminary results 
suggest a modification of the Weber-Fechner formula for intensity 
discrimination for certain types of stimuli: the psychological scale to 
be measured by J? instead of log I. 


The immediate objective of the computations which follow is to 
arrive at a basis for analyzing the schooling behavior of sardines in 
the ocean after a fashion similar to the statistical kinetic theory 
approach to gas laws. It may not be unreasonable to hope, however, 
that very similar considerations might eventually prove of some value 
in dealing with human social phenomena. 

Such analysis as the following, if successful, should be helpful in 
interpreting annual fishing records and in framing intelligent conser- 
vation measures. There is good evidence to indicate that the schooling 
behavior of sardines and similar species is due entirely to certain 
simple responses to visual stimulation (J, 2); and since therefore 
the behavior of the stimulating agent (light) is itself fairly well under- 
stood, it does not appear entirely unreasonable to hope that some fair- 
ly dependable conclusions might be drawn from such an approach. 


1. General picture of processes. A sensory organism makes a 
response or adjustment to a given type of stimulus on condition that 
the intensity and duration of the stimulus are sufficiently great. This 
statement is numerically formulated in Weiss’ “law” which sets down 


It=a+bt (1) 


as a necessary condition of response, where / is the intensity of the 
stimulus (measured presumably on an additive scale), ¢ is the dura- 
tion time, and a and b are constants (3). This clearly implies the two 


‘ * Contributions from the Scripps Institution of Oceanography, New Series, 
o. 54 
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inequalities Jt > a, I > b, so that a and b both play the part of 
thresholds. 

In the following pages a stimulus is pictured as giving rise to a 
series of “bolts” or pulses originating in the set of sensory nerve- 
endings affected by the stimulus. These bolts are assumed to be of 
equal “value” (as suggested by the “‘all-or-none law’’), and to be ran- 
domly distributed in time with a mean density which is proportional 
to the effective intensity of the stimulus. A series of such pulses, on 
passing into the central nervous network, is pictured as having a 
certain cumulative effect on the state of the organism, which, on sur- 
passing a certain threshold value called », gives rise to the pattern 
of responses or adjustments appropriate to the stimulus. This cumu- 
lative effect (or “value’”’) of such a series of pulses is supposed given 
by a sum 


sas | ali de (2) 
where ¢ is the present time, + refers to times of occurrence of the 
pulses making up the series, a is the “value” of each pulse, and 
4(t—+7) is a certain function of the time (t—+7) elapsed since the 
occurrence of a pulse, which may be termed a “retention factor.’”’ Some 
elementary considerations on the possible nature of the mechanism 
by which the effects of the pulses accumulate indicate that y(¢— 7) 
should have the form 


g(¢—r) =e”, (3) 


which in fact represents a probability that a given pulse, after ran- 
dom (multiple) wanderings in the central nervous system, will hap- 
pen to reénter the response-circuit at the same time that later (fresh) 
pulses are also entering this circuit. We shall term the quantity « in 
equation (2) the awareness. 

Now let the intensity of a stimulus be an arbitrary bounded in- 
tegrable function of time, and suppose that the probability distribu- 
tion of density of pulses in any short time-interval is the Poisson 
type, with expectation St proportional to the intensity of the stimulus. 
This amounts to conceiving the pulses as distributed in time as dust 
particles distribute themselves, by random processes, along a wire; 
the expectation represents the mean density if the intensity is con- 
stant. On this supposition it is not difficult to compute a generating 
function for the moments of the distribution of the quantity e« (equa- 
tions 2 and 3) and to obtain therefrom the mean, variance, and other 
characteristics of the actual distribution. 
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A remarkable result is that, for constant % , the mathematical ex- 
pectation « of the quantity « turns out to be proportional to %, and 
the standard deviation proportional to XR. This suggests an interpre- 
tation of the results in terms of the distinguishability of stimulus in- 
tensities. If we imagine that the e’s of two intensities must be sepa- 
rated by an amount proportional to, say, the root-mean-square of their 
standard deviations, in order to be distinguished, then we obtain a 
physiological intensity-scale whose measure is 2t' or (3, + N.)! instead 
of log 9% as in the Weber-Fechner formulation. This really shows 
better agreement with data on visual discrimination than does the 
Weber-Fechner scale; for one thing, the Weber ratio AN/N of discrim- 
ination increment to intensity becomes infinite when 3t > 0, where- 
as AN/N, the ratio suggested by these considerations, approaches a 
finite value for % — 0. This idea has been tried out on numerical 
data, with, some preliminary indications of reasonable success; the 
writer will report at a later date on what correspondence can be 
found.* 

2. Derivation of the retention factor. The retention factor (3) 
mentioned in the last section would seem to appear reasonable in its 
own right, without a derivation, as an approximate description of the 
relative effects on present behavior of equivalent past experiences 
variously removed in time. If pushed for an explanation, one might 
easily invent various modes of justifying the use of such a form. How- 
ever, the following simple picture of the “carry-over” process seems 
to be speculatively interesting and is presented for whatever it may 
be worth. 

For definiteness, think of a nervous pulse as eventually culminat- 
ing in the contractions of one or more muscle fibers in the sensory 
organism. We may think of the quantity a (equation 2) as the imme- 
diate “awareness” value of each such “twitch.” On thus contracting, 
a muscle fiber in general becomes a new source of sensation, and the 
new pulse thus set up may, in the course of its (perhaps multiple) 
wanderings, find its way back into the same sensory channel as that 
which led to the original contraction or into an equivalent sensory 
channel, in which case the contraction will be repeated by the same, 
or an equivalent, muscle fiber. Similarly the contraction may recur a 
third and a fourth time, and so on; on the other hand it may fail to 


*This is not to imply that Fechner’s scale is universally incorrect: for in- 
stance, it is an accurate description of scales of frequencies, such as a musical 
scale, where it has an obvious rational basis. But such a rational basis for 
Fechner’s scale is not apparent when one deals with stimuli which are measu 
in terms of energy-intensity or, for instance, concentrations of chemical sub- 


stances. 
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recur on any one of these cycles, due to the failure of certain syn- 
apses to transmit the pulse. 

Now assume a constant probability 7(< 1) that the proprioceptor 
pulse arising from contraction of a muscle fiber in a given muscle, 
will find its way back into a channel which will lead to the contraction 
of a fiber in the same (or an equivalent) muscle. Let & be the (aver- 
age) time interval required for completion of such a cycle. Then, of 
Q original contractions, an average number Qr will repeat themselves 
after one interval 6, Qr? at the end of 26, Qr? after 36, and so on; so 


oe 7. & has elapsed, the aver- 


age number of fibers which will still be in contraction, due to the origi- 
nal Q, will be 


that after an amount of time t — 7 = 





2 (t-7) gees 
@ @ 


Qr =Qe , (4) 


where 
o 


>0, sincer <1. 
log r 





This is the required retention factor. 


2.1. Possible correlation of w with the complexity of the neuro- 
muscular organism. It appears reasonable to expect that, in general, 
the probability 7 of a “repeat” of one of these muscle-twitches in an 
organism should show an inverse relation to the complexity of struc- 
ture of the organism. That is, r should be less for man than for the 
ant, for example. Hence the quantity (—log 7) should increase rap- 
idly with increasing complexity, since r < 1. On the other hand, the 
average repeat-period 6 can hardly be expected to vary by more than 
a few-fold among different organisms, depending on average lengths 
of dendrites and axons, and numbers of synapses, in reflex arcs. 
Hence the parameter in the above retention factor should exhibit a 
general diminishment with advance on the evolutionary scale. This 
must mean that the overt behavior of the higher organisms should 
exhibit less “instinctiveness,” and be more dependent on specific in- 
creases of r due to conditioning and habit-formation, than is the case 
among the lower organisms. 

Such a conclusion appears to describe the actual state of affairs. 
The ant, for example, exhibits a highly complex social behavior which 
appears to be almost completely instinctive or automatic; while man, 
also a gregarious organism, exhibits practically no elements of his be- 
havior which do not require conditioning for their establishment, or 
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which may not be modified or reversed by conditioning. Even the sim- 
ple “pupillary reflex,” I believe, has been observed to act in reverse 
in some 30% of a group of new-born infants, the pupils enlarging 
themselves instead of contracting on the first exposures to bright 
light. The pupils of course soon reversed their original responses, and 
the proper responses were quickly established. 

On the other hand, there is an artificial preparation which ex- 
hibits behavior corresponding to a value of 7 very nearly equal to 1. 
The center portion of a jelly-fish was removed, leaving only a circular 
periphery of muscular tissue; and all sense organs but one, con- 
nected with this periphery, were removed. The stimulation of the 
sense-organ resulted in two waves of contraction in the muscle tis- 
sue, one traveling in either direction from the point of stimulation. 
One of these waves was blocked; the other was then observed to con- 
tinue its course undiminished round and round the periphery for an 
indefinite length of time. This corresponds to 


See 


a ee ee ee =1 
@ 


in equation (2) (4). 


3. The distribution function of awareness. We seek now the 
frequency distribution of values of the “awareness” « given by equa- 
tions (2) and (3): 

: | 
“wa (5) 
(T) 

corresponding to an arbitrary intensity-function 9(7) subject only 
to the requirements of being uniformly bounded and Riemann-inte- 
grable. As stated earlier, (7) is supposed to represent the mathe- 
matical expectation of time- density of occurrence of the a - responses, 
at any time -. 

Consider any short interval of time Ar, open at its beginning- 
point z, closed at its end-point ;-++ Ar. Suppose that during this 
short interval of time Q primary “twitches” occur, due to the stimu- 
lus acting, at points r,r®,.--, + within this interval or at its end 
point. Then the contribution of this interval of time to the total pres- 


ent value of ¢ in (5) will be 


1 1 
- — (t-7(™) - — (t-7) 


Q 
Ae=a>de” =aQe °* ; (6) 
n=1 


where 7 is some point in4A;. This substitution is possible because 
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1 
-— (t-T) 


of the continuity of e ” in 47. On the other hand, suppose that 
N(zr) does not vanish at any point indr (= closure of 4r) , that is, 
Nr) is positive (and bounded) throughout 4A7;. Then the (Poisson) 
probability distribution of Q is given by 


(MN Ar)? e-RAt 
Q! 


where ® is a value of %(r) in the interval 47, intermediate between 
its two extreme values. Then the probability of obtaining a certain 
sequence Q, , Q.,--:, Qn of Q’s corresponding to a set of such inter- 
vals A,;r, Aeor,---, Amz which do not overlap, is 


B(Q) = , (7) 





m _3 Nn An 7 
‘ IT (Nn Ant) +e *™ 
a(Qn) = I Bn(Qn) = , (8) 


- TI (Qn!) 





and the corresponding contribution of these intervals to the sum (5) 
is, from (6), 


1 
= efter gM) 


D4Anre=aDQe ” . (9) 

Now consider an arbitrarily long z-interval —T< + = t. Since 
Nz) is uniformly bounded and integrable, it follows that, given « > 0, 
there exists a countable set of semi-closed (as described above), non- 
overlapping intervals A,7 , which cover “almost all” of (—T7, t) , and 
such that, for some 7, in Ant, |M,—MN (tr)| < &, where N,, is the “in- 
termediate” value of 3 used in (8). (“Almost all” means all except 
possibly a set of points of measure 0.) 

The “characteristic function” of an arbitrary cumulative dis- 
tribution function f(x) with respect to the stochastic variable x is 
defined by the Stieltjes integral (5), 


Jif) =F) = Jer af (a), (10) 
if this integral exists. In the present instance, corresponding to any 
finite “cut” of the sequence of intervals A,r, the characteristic func- 
tion for the probability distribution of the variable 3 A,e (equation 


n=1 


9) will be given by [the notation exp(x) = e*] 





at 


) 
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nn 
o ) II (Nn Anr)* m -- (te) 
Pi (y) => 2S = -exp(> {tyaQ,e ° 
Q1=0 Q2=0 =Qm=0 n=1 


II (Qn!) 
(11) 
—N,Anz}) == =-:-d (1 on(Qn)}, Say. 


(Q1)(Q2) = (Qm) N=1 


This last expression contains exactly one term corresponding to each 
different set of values of (Q,, Q.,-:-, Qn), Qn = 0,1, 2,--- ; hence 
by rearranging the terms we can obtain 


Pp‘ (y) = L( 3 40(Q) ) 


2 
- — (t-79() 


m © yy ‘ hn ro) Q 
=| 5 ae exp(iy ae )J 
Gr=0 Qn! 





> e-rs | 


n=1 


m _ - — (t-T.) 
= ex(3 N,[exp(iyae ° )—1]) ae 


Now as described above, the intervals A,7 were selected so that we 
may place N, = (tm) + dn, where 7, isa point of A,r, and |dn| < e, 
where e > 0 is arbitrary. Let m > o. so as to cover “almost all” of 
the interval (—7',t). Then, letting 4,7 ~ 0, we get 


x 
S atte 


t T) 
log P'™ (y) = |, 2(7) - [exp(iyae ° ) —1)d-7+R, 
where (12) 


t ae 
IR| <|e f [exp(iyae ®  )—1]dr|>0fore 0; 
-T 





whence R = 0, since « is arbitrary. Finally let T > o and we have 
for the characteristic function P(y) of the probability distribution 
of e (equation 5) 


1 


¢ - — (t-7) 
log. P(y) = [/RG) - Lexpliy ae head ee (13) 


which determines uniquely the character of the distribution of «. 
From (10) it is evident that successive moments of ¢ about « = 0 


are given by 
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k*® moment = uw, = (—)* eos ’ 


whence we easily obtain for the arithmetic mean ¢ = y,, and for the 
variance o*7. = 2 — u’; the following: 


; i acme Shand 
e=maa fe N(zr)dr, 
-0o 


(14) 


ncn (t-1) 
oe = n— 1 = a8 ['e ® Nir)dr. 
-0 
If % is constant we therefore get E~ N, ce W N!, as remarked above 
(§2). 
The first four terms of an expansion in Hermite polynomials 
(Gram-Charlier series), representing the probability frequency dis- 
tribution of e, are easily obtained (6): 


1 : toon a3 
p(e) ee (nds | 


<I 


JO 


«| fie “s* T) minds 
(15) 





- — (t-T) 5 
1 wo [ — Ris) d+ 2. 
~~ 420 


ony ~ 





_ (e-€)? 
20 2 
1 a Oe : ) 


where « and o- have the values shown in (14). The derivative-sym- 
bols d"/de" of course are supposed to operate on the “errcy function” 
shown after the last brace. 


4. Case of two stimuli which inhibit each other. Suppose an or- 
ganism is subject to the operation of two different stimuli which call 
for “opposite” (mutually contradictory) responses. Denote by ¢, and 
é, the “awareness” corresponding to these respective stimuli; then 
imagine that the likelihood of the organism responding to e, (+ — 
response) or to e, (— — response) depends simply on the chance 
(positive or negative) value of the difference 


&y = & —&. (16) 
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Let r(€0) represent the frequency distribution of «,. Obviously 
(co) = [“Ps(e2) -Pilco +e) der. (17) 
The characteristic function for r(e) is easily obtained (5, 275) (plac- 


ing ¢. = e’, and noting that thus P.(y) becomes P,(—y) with respect 
to e’.): it is given by R(y), as follows: 


log R(y) = log P,(y) + log P2(—y) 


1 
t - — (t-T) 
= [ (expliyae 1 ) —1}Ri (7) dz 











(18) 
1 
t 7 -— (t-7T) 
+ | {exp(—iyae @ ) —1} N(r)d-. 
In this case we then obtain as before 
1 1 
= r ¢ -— (t-7) $ —~— (#1) 
& —a fe ™ N,(r)d 7 — fe ” MN. (r)d + 
: ; (19) 
2 2 
af of ¢ -— (t-7) ¢ -— (t-7) 
ote =a [ 8 Rede f e@ 2% Ry (r)dz 
and finally 
1 _(€0-E0)2 
1 (€) = r(e, —= &2) = ———— ¢ te 
V2a Ceo 
(20) 
4 as 1 f eae (t-7) a ss ied d 
a + aa 2 z — a1 1 
6 =| a oe fe eens 
a (€0-E0)2 
x ae’, ad is wi 


5. Discussion: The quantity «, whose distribution is defined by 
(13) and given by (15), represents momentary “awareness” with ref- 
erence to an arbitrary stimulus which acts on the organism. If the 
stimulus is such as to call for a particular form of adjustment (pattern 
of response) of the organism, it seems reasonable to assume that such 
response will take place, or fail to take place, depending on whether ¢ 
at any particular instant exceeds a certain positive quantity » , which 
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may be termed the “threshold of perceptibility.” If the response once 
sets in, we may suppose it will be carried to completion. Thus, given 
an arbitrary large group of individuals in a certain stimulus situation, 
suppose y represents the number which at any time ¢ have not yet 
begun a response. Then evidently on the average a number 


—dy=ydt. [ p(ede (21) 
L/] 


of these individuals will commence response during the (short) ensu- 
ing time-interval dt. 

On the other hand, in the case of conflicting stimuli ¢«, and «, 
(§4), consider a response to ¢, as a positive response, one to «, as 
negative. If neutrality or indecision is possible, then a negative re- 
sponse occurs if «, < — », the organism remains indecisive for 
— » = & =7, and a positive response occurs for e, > 7. Thus in 


this case 
dy” — 
—S =f r(€)d & 
P (22) 
y 


a 2 =v T(E) d &. 


The writer plans to make use of these general ideas in a deduc- 
tive theory of schooling behavior of sardines, now under preparation. 
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THE COMMON FACTORS IN FIFTY-TWO MENTAL TESTS 


HERBERT WOODROW 
University of Illinois 


An analysis by Thurstone’s centroid method of the intercorrela- 
tions of fifty-two tests was carried to ten factors. Included were tests 
of social intelligence, Philip’s attention tests, and Seashore’s tests of 
musical ability. After rotation of axes, the most important factors 
appeared to pertain to operations conventionally alluded to by the 
following terms: verbal facility; spatial ability; numerical ability; 
attention; musical ability; and memory (or memory span). The so- 
cial intelligence tests proved to be mainly tests of the verbal factor. 
A factorial sex difference was indicated by the superiority of men in 
tests of spatial ability. 


Here reported are the results obtained through a factor-analysis 
of 52 tests by means of Thurstone’s centroid method. Their chief val- 
ue probably lies in their bearing on the abilities involved in tests of 
social intelligence, attention, temporal discrimination, and musical 
ability. 

The factor-loadings have a high probable error, because of the 
small number of cases, 110, and the use of tetrachoric correlations, so 
that the results need to be interpreted conservatively. It is becoming 
rather customary to consider a loading to be significant only if it 
exceeds three times the standard error of an original correlation of 
zero. According to this criterion, in the present case loadings less 
than -++.45 are not significant, since the standard deviation of a tetra- 
choric correlation of zero, when based on 110 cases, with division into 
dichotomies at the median, is +.15. 

The group of subjects* was composed entirely of freshmen and 
sophomore students. In chronological age, the median was 19.9 years, 
and the range, 18.0 to 23.2 years. The group was composed of 55 per- 
sons of each sex. In order to eliminate, or at least nearly so, the ef- 
fect of sex on the correlations, each individual was scored as above or 
below the dichotomic line of his own sex (taken as near the median 
as avoidance of tied scores would permit). Thus, men above the men’s 
median and women above the women’s median in a given test were all 
marked plus, even though the medians of the two sexes were differ- 


* The tests were given in the spring of 1934 to students employed under the 
National Emergency Relief Administration. The long time required to complete 
the analysis is due mainly to the fact that the analysis was repeatedly begun 
anew on account of new developments in factorial methods. 
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ent. In only two tests did the occurrence of tied scores lead to the 
choice of a dividing line outside the limits of the 40th and 60th per- 
centiles. 

The centroid analysis, made in accordance with the procedure 
described by Thurstone in The Vectors of Mind, was carried through 
ten factors. The communalities were taken in the case of each vari- 
able as equal to the highest correlation shown by that variable with 
any of the others. A new estimate of the communality was made in 
the case of each new matrix of residuals. The original correlational 
matrix containing the 1,326 test intercorrelations, the original fac- 
torial matrix, and the orthogonal transformation matrix will not be 
presented, since all these tables* are but means to the final outcome 
shown by the rotated factorial matrix constituting Table I. The sum 
of the original first factor loadings was 23.37, while the sums of the 
loadings} of all remaining factors were within the limits +.10. The 
transformation of the factors was carried out graphically by rotating 
one pair of axes at a time, and endeavoring always to maximize the 
number of insignificant loadings. The number of rotations of pairs 
of axes required was 119. The distribution of the residuals, or differ- 
ences between the original correlations and the correlations which 
would result solely from the ten factor loadings, was fairly symmet- 
rical, with a mean of —.005 and a standard deviation of .086. 

The 52 tests were given in the course of 10 periods of approxi- 
mately 1 1/2 hrs. each, and in the same order as that in which they 
are listed below. A few minutes of fore-practice, during which ques- 
tions were encouraged, was provided for every test. Since tetrachoric 
correlations were to be used, it was decided not to call “stop,” in the 
case of time-limit tests, until five or six students had indicated by 
raising their hands that they had finished. The time allotted was 
therefore determined at the time the test was given. This procedure 
was practicable only because the entire group was tested simultane- 
ously. Most of the spatial tests (Nos. 9, 17, 19, 20, 28, 42, 47, and 48) 
and test No. 15, verbal classification, were identical with those used 
by Thurstone and are included in the recently published supplement 
to his monograph on Primary Mental Abilities (3). 


* Mimeographed copies of the three matrices will be furnished upon receipt 
of request addressed to the Department of Psychology, University of Illinois. 

+ Theoretically the algebraic sum of the loadings of all factors after the first 
should be zero. In order to obtain this result, within the limits of +.10, and at 
the same time maximize the amount of variance accounted for by the factor, 
slight deviations from the sign-changing procedure described by Thurstone were 
in some instances found necessary. After one half the signs of all tests had been 
made positive, study was made of the effect of further sign reversals, and one or 
two further reversals were made if it was found that they increased the total 
variance accounted for by the factor and at the same time brought the algebraic 
sum of the factors nearer to zero. 
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List of Tests 


The first three tests were from the American Council on Educa- 


tion Psychological Examination, 1930 edition. The tests of the 1932 
edition were used for fore-practice. 


1. 
2. 
3. 


CHAR AE 


10. 
ai. 
12. 
18. 


14. 
15. 


16. 
a. 


18. 


19. 


21. 


22. 


23. 


24. 
25. 


Completion (of sentences constituting definitions). 8 1/2 min. 

Form analogies. 8 1/2 min. 

Opposites and similarities. Detecting pairs of words (in sets of four) with 
either the same or opposite meaning. 5 1/2 min. 

Tests 4 to 8 inclusive constitute the “Social Intelligence Test,” revised form, 
first edition, prepared by Moss, Hunt and Omwake. 

Judgment in social situations. 9 min. 

Recognition of the mental state of the speaker. 4 min. 

Observation of human behavior. 5 1/4 min. 

Memory for names and faces. 5 min., plus 4 min. for studying photographs. 


Sense of humor. 6 1/2 min. 

Areas. Counting the number of square units of white surface in irregular 
black and white areas. 8 min. 

Opposites, especially constructed multiple choice test. 7 min. 

Disarranged sentences, especially constructed test. 6 3/4 min. 
Disarranged syllables, especially constructed test. 15 min. 

Disarranged letters (termed disarranged words by Thurstone and others; 
anagrams by Foster and Tinker and others). Rearranging letters to form 
one word from each set of letters. Especially constructed test. 12 min. 
Analogies. Especially constructed multiple choice test. 9 3/4 min. 

Verbal classification. Marking words to indicate to which of two categories 
they belong—the nature of the two categories being indicated only by adja- 
cent lists of words. 10 min. 

Vertical addition (of columns of one place numbers). 10 min. 

Flags. Indicating whether two pictures show the same or opposite sides. 
3 min. 

Lozenge B, Thurstone’s Spatial Relations Test B. Stoelting catalogue No. 
27115. Indicating whether two diagrams show same or opposite faces. 5 min. 
Punched Holes. Marking position of holes made by punching a paper folded 
as indicated by diagrams. 4 1/4 min. 

Pursuit test. Tracing the paths of intertwined lines. 5 min. 

Filled intervals. Discrimination difference limen, based on 120 comparisons 
with a standard of 1.0 sec. Score, 75 per cent limen, in units of .001 sec. 


60 min. 

Tests Nos. 22, 23, 29, 30, 32, and 88 were Seashore’s Musical Ability Tests 
(2), given by means of phonograph records. 

Pitch discrimination. 

Intensity discrimination. 

The Otis Group Intelligence Scale, Advanced Examination, Form A, was 
used for tests 24-27, inclusive. 

Following directions. 5 min. 

Proverbs. 5 min. 
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26. Arithmetic. 5 min. 

27. Narrative completion. 5 min. 

28. Copying (horizontal). Copying four-line irregular figures. 7 1/2 min. 

29. Time. Seashore’s test, requiring comparison of two intervals marked off by : 
three clicks. Score, per cent correct. 

30. Rhythm. Discrimination of pairs of rhythmical sequences of clicks. 

oi. Empty intervals. Discrimination difference limen, based on 120 comparisons 
with a standard of 1.0 sec. Score, 75 per cent limen, in units of .001 sec. 
60 min. 

82. Consonance. 

33. Tonal memory. 

84. Reaction time. Simple reaction time to sound, measured by a Hipp’s chron- 
oscope, with 2 sec. preparatory interval. Given individually. Score, median 
of 50 trials. 

35. Dot counting. Especially devised test requiring counting the number of dots 
in 16 horizontal lines each composed of about 25 groups of irregularly ar- 
ranged small dots, the groups varying in number from 3 to 5. Score, num- 
ber attempted. 9 1/3 min. 

Tests 36-39 and 41 consisted of Philip’s “attention” tests (1). 

36. Number span. Auditory memory span for digits, when digits are inter- 
spersed with consonants. Score, total number of digits correct. 

387. Horizontal addition. Philip’s test modified by making it longer and equal- 
izing difficulty of the lines. 5 min. 

38. Alphabet. For example, if examiner dictates d 2 3, the subject at a signal 
following a pause should write f 7. 

39. Mental multiplication. Answers written upon signal “Write.” 

40. Vertical copying. Like test No. 28, except that diagrams to be copied were 
exposed on large vertical charts at the front of the room. 15 min. 

41. Cancellation. Philip’s, multiple instruction. Score, number of correct can- 
cellations. 12 min. 

42. Block ccunt. Determining from line pictures of irregular piles of blocks 
how many blocks touch each of five lettered ones. 9 min. 

3. Skeleten words. Especially constructed test, requiring the subject to fill in 
missing letters in order to make words. 9 min. 

44, Substitution. Woodworth-Wells. Requires pairing a digit, supplied by a key, 
with each of 5 geometrical forms printed in irregular order on a page of 
100 forms. 

45. Memory span for digits. Score, total number of digits correct. 

46. Lozenge A. Thurstone’s Spatial Relations Test A. Stoelting catalogue No. 
27114. Indicating where the hole in a card would be if the card were turned 
over and rotated to a given position. 5 1/4 min. 

47. Cubes. Determining whether two drawings of a cube with figured faces 
could possibly represent the same cube. 8 1/2 min. 

48. Paper form-board. Thurstone’s. 10 min. 

49. Simultaneous letter-span. Perceptual span for simultaneously exposed let- 
ters. Given individually. Score, total number of letters named correctly. 
30 min. 

50. Word-Building. Score, number of words composed from one set of 6 letters. 
Two tests, 3 1/4 min. each. 

51. Digit cancellation. Stoelting catalogue No. 27011. Cancelling digits 2, 4, 
and 9. Score, number of correct cancellations. 4 3/4 min. 

52. Speed. Making crosses in each square of a cross-section sheet. 6 min. 
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The loadings of each test with each of the factors finally reached 
after many rotations is given in Table I. The loadings are those with 
uncorrelated, orthogonal factors. 


TABLE I. Factor Loadings after Rotation of Axes 


The factors are designated by the Roman numerals I to X. 
h2 = proportion of the total variance of a test due to the ten common fac- 
tors, i.e., the sum of the squares of its factor loadings. 














mm 8 II Ill IV Vv VI VII VIM Ix x m 
1 176 207 «=.048. S008 «075s 211 —.026 = 281 —.157 = 125817 
2 209.785 201s «154 —.018 009 «156 —.122 —.106 —.262 .843 
3 ‘770 .254 085 —.052 029 .220 —.083 .020 .006 —.222 766 
4 175 057 —.145 —.018 —.014 —.052 093 —.085 194 —.102 — .692 
5 552 © .229 —.0838 —.082 —.060 .103 .106 518 —.015 —.019 - .665 
6 847 —.128  .020 —.019 —.017 —.023 185 153 109 .265 875 
7 .154 501 —.192 —.040 083 «= .210 —.022 —.192 146.272 7.496 — 
8 856 «6.083 «S061 Ss «062 ~—.040 = 039 —.177 —.046 085 —.085 —.798 
9 —034 534 .448 ~.263° —.064 374 —.058 —.177 —.096 —.103 — .749 
10 =—«.911 019 —.061 059 —.090 267.231 001 185058 «998 
11 826 055 «= «154. 088 004. —.118 —.110 —.238 051 —.126 809 
12 410 800 369 =.017 = «180 = .289 «006 S 248.280 —.255 - .689 
13 208 348 231 .888 —.070 .276 —.069 —.007 .175 —.165  .508 
14 471 «=-«.698.s «101.s«024 ~ —.075 = 069 072 ~ —.008 —.023 269 —.801 
15 690 .298 —.155 .346 064 .208 .104 —.015 —.066 —.028 | .772 
16 238.101.7538 ~—.210 = 087 097-116 —.002 —.206 049.744 
17 —.064 .732 116 .255 .078 —.036 037 .054 —.092 .029 ~-.639 
18 098 .454 —.014 346 —.072 —.159 011 087 —.195 —.104 ~ 1.423 
19 044 = 820. «082,284 —.086 «= 056 = 048.279) 178 127 897 
20 —.287 .800 847 086 .350 .119 .106 .276 —.003 020  .499 
21 110.064. 189 —.258 460 = 090.084 = 202 —.237 244.96 
22 —.0938 029 .164 .214 .757 064 —.215 .086 135 —.005 .731 
23 —.057 088 —.196 .296 .590 —.043 .196 —.095 —.199 .054 571 
24 363.108.8389 «46T = «007-078 = «259 279 «001 = 128.678 
25 668 038 —.028 .196 067 051 .104 506 —.048 .128 779 
26 294 800.358 «97 —.044 —.138 156 —.022 —.2838 —.187 .792 
27 461 175 428 —.069 —.093 221 —.006 —.061 158 318 614 
28 074 = .606 —.085 092, «345.057 —.114 248) 355 162.782 
29 044 —.060 119 —.176 522 044 .855 085 —.074 —.220  .512 +> 
30 168 —.118 172 .120 .314 150 —.080 .105 —.134 —.292  .326 
31 259 ©6185 —.052 —.023 503 = 022-649 048 008 015.766 
32 113 —.080 148 104 .618 —.074 —.024 —.286 194 —.061  .568 
33 —150 .015 .258 .110 .708  .053 —.028 .059 .147 —.006 . .681 
34 057 = .268 ~——.004 —.250 830 —.022 277 —.051 .255 182 _ .424 >” 
35 199 —.068 .722 127 .288 040 —.125 —.101 —.213 075 = 717 
36 308 —.146 267 = 067 —.092 .728 068) 052 161 —.092 —.765 
37 —.015 .210 819 112 025 —.030 .205 .109 074 —.055 791 
38 025 .059 «185 606 = 825 «382 —.138 —.036 —.098  .229 728 
39 —.171 164 434 185 —.065 .390 .298 —.107  .068 —.250 - .602 
40 019 369 —.126 632 —.050 278 —.041 .222 187 060 = .722 
41 146.824. 278 = 4438 —.049 091 «187 —.196 212.094 ~ .538 
42 137.801 «121 —.060 —.158 —.057 .141 —.009 302 —.288 —.901 
43 826.889 439 —.105 = 088.209 —.196 = 147 —.229 —.042 _ .618 
44 156.286 = 410 «303 055.269) 280 —.028 —.028 811 — .617 
45 —.004 081 072 —.054 —.121 .746 —.156 002 —.090 .124  .637 
46 035 = 853 088) 852177: 221 087) 075 1438 —.123 991 
47 301 611 064. «337 —.049 —.150 096 —.011 —.144 1174-667 
48 —.026 .787 —.012 .819 028 098 —.263 .200 .324 —.049 .950 
49 —.216 .175 —.248 833 .216 —.122  .021 .110 .239 142 898 « 
50 845 282.860) 6192) 076 = 414 —.190 —.188 —.261 .193 ° 720 
51 225 —.153 518 .298 —.198 —.031 .025 —.079 .246 .273. .606 
52 058 —.072 —.055 301 .494 —.108 024 .038 —.083 441 561 
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In interpreting Table I, the high unreliability of the factor load- 
ings should be kept in mind. Probably the safest procedure would be 
to consider the results inconclusive whenever the factor-loadings are 
in the neighborhood from .25 to .45 and to consider as positive only 
instances where the obtained loadings are +.45 or over. Only the first 
five factors show five or more tests with loadings as high as -+.40, 
Two other common factors, however, VI and VIII may be psychologi- 
cally meaningful, though factor VIII, conspicuous in only two tests, 
admittedly presents some difficulty. Factors VII, IX, and X are large- 
ly superfluous, in that none of them show loadings as high as +.40 
on the part of more than one test. Probably none of the factors con- 
stituting the rotated reference frame are meaningless in the sense of 
revealing unquestionably significant loadings on the part of a hodge- 
podge of tests to which no common factor could plausibly be attrib- 
uted. 

In order to set forth more distinctly the nature of the common 
factors exhibited by Table I, tests with loadings over +.40 will be 
listed by factors. 


Factor I (Verbal) 


1. Completion (of definitions) ...... Sf SMe | Se. Co Co. naa ae 911 
3. Opposites and similarities ........ .770 11. Disarranged sentences ............ 826 
4. Judgment in social situations.. .775 12. Disarranged syllables .............. 410 
5. Recognition of mental state of Be PAWENORIAES shh ats AT71 

SRRC I HINOT oc sacs scence eos .552 15. Verbal classification ................ .690 
6. Observation of human behavior .847 25. Proverbs ................:..:cccccseceeeeeeeees 668 
8. Sense of humor -........-........2...:.. .856 27. Narrative completion .............. 461 





It seems clear that this factor is a verbal factor. It agrees fairly 
well with Thurstone’s factor V (verbal relations). It is noteworthy, 
however, that three other tests dealing with the composition of iso- 
lated words have relatively low, presumably insignificant, loadings 
with this factor, as follows: No. 43, skeleton words, -+.326; No. 50, 
word-building, +.345; and No. 13, disarranged letters, -+-.208. Accord- 
ing to Thurstone’s results (8, p. 84), it might be expected that these 
three tests, together with No. 12, disarranged syllables, which has a 
loading of -+.410 with factor I, would all show significant loadings 
with a word-factor, W. Why such a second verbal factor was not 
found in the present study is not clear. 

Of major interest is the fact that tests of “social intelligence” 
appear to be for the most part tests of the verbal factor. With the 
exception of the test for memory for names and faces, which turned 
out to be primarily a visual or spatial test, all the social intelligence 
tests are primarily tests of verbal ability. This fact does not neces- 
sarily prove that there is no special factor involved in successful so- 





oad- 
d be 


are 
only 
first 


Ogi- 


Sts, 
20. 


ib- 


on 
be 








HERBERT WOODROW 105 


cial behavior which may be tested by pencil and paper tests. It is 
conceivable that factor VIII represents such a factor. 


Factor II (Visual-Spatial) 




















2. Form analogies ....................--- -785 19. Punched holes .820 
7, Memory for names and faces.. .501 28. Copying, horizontal .................. .606 
9. Areas .584 42. Block-count 801 
14. Analogies 1698 4G, Wezenge A. nc. ..0c. essen nsesceceeen .853 
17. Flags pr VARS (MR: C22) ne ee 611 
18. Lozenge B .454 48. Paper form-board ..................... -787 





There can be little doubt that this factor is a visual or spatial 
factor. The most puzzling loading in the entire table, however, occurs 
in connection with this factor, that of +-.693 on the part of the verbal 
analogies test. It can perhaps be explained in the same way that 
Thurstone explains a loading he obtained with this factor in the case 
of test No. 15, verbal classification, namely, by reference to the sup- 
posed helpfulness of visual imagery in the performance of the test. 
In the case of three other tests, one might possibly expect higher 
loadings. These are the following: No. 20, pursuit, + .300; No. 26, 
arithmetic, + .300; and No. 40, vertical copying, + .369. The last of 
these seems very clearly to be a spatial test. As pointed out above, 
however, loadings of a magnitude shown by these three tests are in- 
conclusive. That Test No. 7, memory for names and faces, which is 
one of the “social intelligence” tests, should be a test of this factor, 
rather than of factor VI, a memory factor, is what might be expected 
if factor II is interpreted as visual-spatial. 


Factor III (Numerical) 








9. Areas .448 89. Alphabet (Philip’s test) ........ 434 
16. Vertical addition ...................... -758 48. Skeleton words .....................+.++ 4389 
27. Narrative completion .............. .428 44. Digit-form substitution .......... 410 
35. Dot counting .722 51. Digit cancellation ...................... 518 
37. Horizontal adding .................... 819 


The three highest loadings with factor III are shown by three 
adding tests, so that the factor is presumably a numerical one. It is 
difficult, however, fully to make sense of all the obtained loadings. 
Why the test of skeleton words should have as high a loading as 
Philip’s alphabet, which requires much counting, is not clear. Narra- 
tive completion also offers some difficulty. It is true that it involves 
counting in that the words supplied to complete the narrative must 
contain designated numbers of letters. Test No. 1, however, which 
also requires substitution of words of designated numbers of letters, 
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shows a loading with this factor of only +.043. The explanation very 
likely lies in the ambiguous status of loadings between +-.25 and +45, 
such as shown by narrative completion, alphabet, skeleton words and 
digit-form substitutions. Their “true” loadings may be either of 
considerable magnitude or quite inconsequential. 


Factor IV (Attention) 


24. Following directions ................ 467 40. Vertical copying .....................-.. 632 
NS | enna enna eee ereet .597 41. Philip’s cancellation ................ 448 
URS ASS 2 GEER eee ee renee oR .606 


Two of the five tests of factor IV, Nos. 38 and 41, are among 
those devised by Philip in order to measure attention. This factor 
probably deserves further investigation and any name given to it at 
present may be more misleading than helpful. A name applied to a 
factor is at best likely to be misleading and in no case establishes the 
ultimate nature of the factor. It is merely a speculative suggestion 
as to the kind of operations in which the factor is manifested. Factors 
supposedly constitute discoveries newer by centuries than those which 
called into existence such concepts as perception, attention, reason- 
ing, imagination, etc., and, once well established, they should have 
new labels. This is particularly desirable because of the ambiguity of 
most of the older, more or less prehistoric, terms. 


FAcToR V (Musical) 
21. Filled interval comparison ...... .460 31. Empty interval comparison .... .503 
I ees TO1) “Bee MONONA foc csiccccericctcinces 618 
ae EEN 590 33. Tonal Memory ........................-- -708 
UD aro scat dcesaxtnoue .522 52. Speed 494 





This factor appears to be one of importance in musical perform- 
ances. All the tests are auditory except No. 52, the motor speed test. 
The fact that the speed test shows a correlation of +.494 may per- 
haps be accounted for by the assumption that in this group of sub- 
jects those who were superior in the auditory tests also on the aver- 
age made superior speed scores on account of past practice obtained 
in the playing of musica] instruments. 


Factor VI 
386. Number span (Philip’s test) -................... -728 
45. Memory span for digits ..............22...2.-:0:-0+- -746 
Ree CR ESE 414 


There is a suggestion here of a memory factor. 





ery 
45, 
and 

of 


443 


ng 
or 


a 
he 
on 
rs 


n- 
7e 
of 


3 


8 
4 





HERBERT WOODROW 107 


Factor VII 
81. Empty intervals .649 





The second largest loading with this factor, +.355, was shown 
by another test of discrimination of empty intervals, namely, test No. 
29, Seashore’s time test. 

Factor VIII 


5. Recognition of mental state of speaker.... .518 
25. Proverbs 506 





Any interpretation of this factor would be quite speculative. Ex- 
amination of test 25, proverbs, reveals the fact that practically all 
the proverbs and statements included in the test deal with social situ- 
ations. It is not impossible, therefore, that tests 5 and 25 may have 
in common some factor related to a certain type of knowledge. 


Factor IX 


No test loading over +.36 


BORNE ns cps cece er on cease cians ccacvacenassbdansoecneas 441 


Sex Differences. — A quantitative index of sex differences in 
factors was obtained by calculating in the case of each factor the 
mean of the significance ratios (M, — M.)/(PEvx, -u,) of all the tests 
of that factor with a loading over -+-.40. The results for the first five 
factors are shown in Table II. Male superiority is indicated by a plus 
sign. In addition to the means of the significance ratios (the values 
of which are given in the column headed M.S.R.), the table shows 
their P.E.’s and the ratios of the means to the P.E.’s. These latter 
values of course depend largely on the number of tests from which 
the mean significance ratios were calculated. 


TABLE IIT 


Sex Differences 


Factor No.tests  M.S.R. PEys.x, M.S.R./PEyse. 
I 18 + .40 87 1.08 
Il 12 43.85 Al 9.39 
Ill 9 41.35 68 1.99 
IV 5 42.10 81 2.59 


V La i a 02 44 06 
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Table II shows that while the two groups differed very little in 
the tests of factor I, the verbal factor, the men manifested rather sig- 
nificant superiority in the tests of factor II, the spatial factor. The 
men exceeded the women in all twelve tests of this factor, except No. 
7, memory for names and faces, in which case the scores were equal. 
In the case of all other of the first five factors, women exceeded the 
men in at least one test of the factor. 
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ON A SHORTENED METHOD OF ESTIMATION OF MENTAL 
FACTORS BY REGRESSION 


WALTER LEDERMANN 
Moray House, University of Edinburgh 


A shortened method of finding the regression for estimation of 
the scores on mental factors is presented. If the correlations among 
n tests are accounted for by r factors, the regression equation in- 
volves the inverse of an r X r matrix instead of the inverse of an 
n X n matrix. When r is much less than n, there is considerable sav- 
ing in computational labor. A numerical example is presented to il- 
lustrate the method. 


I. Object of the Paper 


Suppose that a factor analysis of a team of n tests has revealed 
7 common factors and n specific factors. As the total number of fac- 
tors is greater than the number of tests, the value of any particular 
factor can be estimated only from the test-scores, the “best” estimate 
being that obtained by the Regression Method. Now the formulae 
supplied by the general theory of multivariate regression always, in 
some form or other, involve the computation of the reciprocal of an 
n-rowed matrix — the correlational matrix of the n tests — , a task 
which, for large values of n, is very laborious indeed. However, in 
the special case in which the tests depend on only one common factor, 
g, an explicit and much more convenient formula is available: 


estimated g = [1+3 = me To (1) 
where z; is the score in the i" test and /; is the saturation of that test 
with the common factor g. The formula (1) is due to C. Spearman’, 
and its consistency with the general regression equation has been 
proved.+ Professor Godfrey H. Thomson pointed out to the present 
writer that it would be desirable to generalize formula (1) so as to 
cover the case of several common factors whose number according 
to L. L. Thurstone’s theory is assumed to be appreciably smaller than 
the number of tests. It is the object of this paper to provide such a 
formula. It will be shown that by a simple transformation the regres- 


* Spearman, C. The Abilities of Man, Appendix §§ 6-8, London, 1932. 
+ Thomson, Godfrey H. “The Meaning of ‘i’ in the estimate of ‘g’,” Brit. J. 


Psychol., 1934, 25, 92-99. 
—109— 
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sion coefficients can be made to depend on the reciprocal of a matrix 
of order r only, and it appears that the total arithmetical work is thus 
reduced. 


II. Transformation of the Regression Formula 


The result of the factor analysis of a team of tests can be sum- 
marized in the matrix equation 


z= Mf, (2) 
where the column vector 
a= {Z1 » 22 5°**, Zn} 
denotes the tests and 


f= {fo fi} = {for » for »°°* » Sor faa» Faas °°* s Fan} (3) 
indicates the (7-+-n) factors, which are assumed to be mutually or- 
thogonal; the first subscripts 0 and 1 refer to common factors and 
specifics respectively. The matrix M , the complete factorial matrix, 
has 7 rows and rn columns of which the first r contain the loadings 
of the tests with the common factors, whereas the last » columns 
correspond to the specifics. Thus 


My, *** My | m,0--0 | 
= Moi °**Meor § O Me: 0 (4) 


The test-scores are supposed to be in standard measure, i.e., we have 
> mip + m5, =1 (t=1,2,---,m). (5) 
p=1 


As is clear from the above notation, the matrix M, is an nr (rec- 
tangular) matrix, and M, is a diagonal matrix of order n. Hence we 
can write (2) in the form 


z=M,f,+M,/., 


exhibiting the parts that are due to common factors and to specifics. 
Now, if we estimate the factors f from (2) by the usual regres- 
sion method, we obtain* 


f=M'R-z, (6) 
the bar being used to denote the estimated values, as distinct from the 


* See e.g. Thomson, Godfrey H., “Some Points of Mathematical Technique in 
— Analysis of Ability,” Jour. Educ. Psych, 1986, 2%, equation (15), 
p. 41. 
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hypothetical or “true” values f. On partitioning f in the same way as 
fin (3) and substituting for M we see that the estimates of the com- 
mon factors (those of the specific factors are of no practical interest) 


are given by 
fo = M'".R“z, (7) 
where Ff, the correlational matrix, is related to the factorial matrix 
by the equation 
R=MM', 
or by (4), 
R = M,M',+ M?,. (8) 


(Since M, is a diagonal matrix we obviously have M’, = M,.) If we 
define the *<r symmetrical matrix J by 


J= M’'~.M,-? M, ’ (9) 
the following identity is readily verified: 
M’».Ro3 = (I+ J)3M'.M,? , (10) 


where J is the unit matrix of order 7. Indeed, we have 
M’,.M,-?(M,M', + M,?) = (M’.M,°M,+1)M",, 


whence we obtain (10) after premultiplying by (J + J)-! and post- 
multiplying by R-?. On substituting (10) in (7) we finally get 


fo = + J)7M'M,"z , (11) 


where now only the reciprocal of an 7-rowed matrix is required. 

In a numerical example one should proceed as follows: Divide 
each, element of the it" row of M, by m;? (we are using the notation 
of equation (4)); this gives the matrix M,°M,, the transpose of 
which occurs as a factor in the right-hand member of (11). Then 
multiply M, into M,?M,, column by column, which yields the 
rr matrix J ; since J is known to be symmetrical, it is necessary to 
calculate only those elements of J which lie (say) on or above the 
diagonal. The computation of (J + J)- and indeed of 

(I+ J)-M’.M,- 
can be carried out by one of the methods specially devised for such 
work, e.g., the Doolittle Method* or A. C. Aitken’s “Pivotal Conden- 
sation Method.” + 


* See Holzinger, K. J. Student Manual of Factor Analysis, Chicago: The 


University of Chicago Press, 1987, pp. 37ff. 
+ Aitken, A. C. “Studies in Practical Mathematics I,” Proc. Royal Soc. Edin- 


burgh, 1986-87, 5/7, 172-181. 











112 PSYCHOMETRIKA 


It is of some interest to verify that, when 7 = 1, our formula 
(11) reduces to Spearman’s formula (1). To conform to the usual no- 
tation we shall denote the (only) common factor by g and write the 
factorial matrix as 





| Lim. 
l, - Ms 
= 
M = | 
| | 
| l, : ew | 
where 7 os 
2; + m?; =]. (¢ = i, 2,-*-, MN), (12) 


according to (5). The matrix M, is then simply the column vector 
l= {l,,b,-++, ky}, 


and the matrix J is the scalar 


l, 
IM*l=Y—_ = 3, 


i i i l,? 


by (12). Similarly, we get 











shi 
‘°.M,-z = 'M,-z = _— 
M M, 2 ne =i i 
Substituting this in (11) we find 
_ l; Ri 
— J = | 
g [1 +2 aim Fe 2j 8 1,2’ 


which is identical with (1). 

In the case r = 2, it is still fairly easy to obtain an explicit 
formula for the estimates of the two factors f,, and f,. by resolving the 
matrix equation (1) into components. Let 

L, , l, pel, 
and 
k, ’ k, f2°2 » Re 
be the loadings with the first and second factor respectively, and let 


=1—/1?— k;? (t=1,2,---,m). 
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Put 
l 
= 2 
. ~ & is 
K = ey (13) 
L, k; 
i= . 
: 5; M;? 
We then get 
Fou = [(L-+A) (14) — we) ((1-4a) SHS wp BEY, 
(14) 
= Bs L; 2; 
fee = [(1-+-4) (1+-e) — *)"{ —, ude + (1+«) > —+}}- 


MM; 


III. A Numerical Example 


We shall illustrate the method outlined above by applying it to 
a fictitious example which was constructed by Karl J. Holzinger* and 
treated by him by means of the usual regression method. In this ex- 
ample we have 7 tests and 3 common factors, denoted by a, f, y. The 
factorial matrix is given by 


7 (5 26 
6 6 28 
5 8 11 
M = [M, : M,] =| .7 6 15 | 
7 6 6 28 | 
A 8 20 
8 .36 








The elements of the matrix, i.e., the specfics, have been determined 
from the condition that the total variance of each test is equal to 
unity (cf. equation (5)). 

For carrying out the computation it is more convenient to ar- 
range the data in a table as follows: 


TABLE I 
1 2 3 4 5 6 7 hae 
= “tu 2 «2 2 2 «8 a eee 
B 5 6 8 
Y 6 6 8 





m|.26  .28 11 15 28 20 386 


* Holzinger, Karl J. loc. cit., p. 32. 
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The columns marked 1, 2, 3,... refer to the seven tests whose load- 
ings with the factors a, £, y are listed in the first three rows of our 
table. The fourth row gives the squares of the specific loadings of 
each test. 

Next, we derive a new table by dividing the first three entries in 
each column by the last entry, thus: 








TABLE II 
1 2 3 4 5 6 7 
a/m2 | 2692 2.148 4.545 4.667 2.1438 2.000 2.222 
p/m? | 1928 2143 7.278 
ym | 4.000 2.143 4.000 


(To save space we are giving only three places of decimals, but the 
calculations have been carried further). We can now compute the 


matrix 
J = M'.M,?M, 


(equation (9)) by multiplying Table I and Table II row by row 
(omitting the last row of Table I) ; i.e., we have to find the quantities 


A ai" ; r ai Bi 
Jaa = > Jos = joe = SD — 


. 5 ge 
Ene ae I= Bas? (45) 


i; Mm,” 
the summation extending over all tests, e.g., 
jaa = (.7) (2.692) + (.6) (2.143) + (.5) (4.545) + (.7) (4.667) 
+ (.6) (2.143) + (.4) (2.000) + (.8) (2.222) = 12.578 ; 
Jag = (.7) (1.923) + (.6) (2.143) + (.5) (7.273) +- (.7) 0 
+ (.6) 0+ (.4) 0+ (.8) 0 = 6.268. 
The numbers (15) are arranged in a 33 matrix, thus: 


12.573 6.268 5.686 
J =| 6.268 8.065 ; (16) 
5.686 6.886 . 


Finally, we have to find the matrix product 
(I+ J)7M'.M,?, 


where the matrix M’,M,~ is given in Table II. Below we indicate the 
work-sheet for this calculation, constructed according to A. C. Ait- 
ken’s method previously referred to. The Doolittle scheme, of course, 
also lends itself to this problem. 

On the left of the central vertical line, rows A,, A2, A; contain the 
elements of the matrix I + J (i.e., the matrix (16) in which each 
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Computation of the Product (I + J)-1M’,M-?, by Aitken’s Method 












































check 

column 
“A, | 18.573 6.268 _ 5.686 —4 24.527 
a, | 1000 462 419 —.074 1.807 
A, 6.268 9.065 0.000 | 14.334 
A, 5.686 0.000 7.886 ~~ 12.571 
A, | 2692 1.928 4.615 
™ 2.148 2.148 | 4.286 
A, 4.545 7.272 | 11.818 
A, 4.667 4.000 8.667 
A, 2.148 2.1438 | 4,285 
A, 2.000 4.000 6.000 
Aro 2.222 2,222 
B, 6.170 —2626 | 462 —1 8.007 
‘B, 1.000 — .426 075 — .162 487 
B, —2.626 5.504 419 = | 2.297 
‘B, 680 —1.128 198 — 
B, 1.153 — .898 .158 413 
B, | 5.174 —1.904 835 3.604 
“= —2.155 2.045 344 284 
B, — 990 1.245 158 A13 
a — 924 3.162 147 2.386 
B, —1006 — 981 164 —1.793 
C, 4.387 ss =< at 1 | 8.577 
C, 1.000 140 — 097 — 228 | 1815 
C. — we 147.110 se 
A — .407 072 187 | — 148 
C, .297 — .052 838 | 1.083 
e. 1.128 505 — .349 | 1.284 
é. 824 232 — .160 | 896 
C, 2.769 216 — .150 | 2.886 
C, —1.368 240 — .166 | —1.298 
D, 265 029 —.191 | 103 
D, 129 «= 6147 — 092 | 184 
D, — 094 867 .068 841 
D, BT — 200 267 | 364 
» | 116 — .080 187 | .223 
ye | — .172 119 631 | 578 
D. 482 — 299 — 312 | — .178 
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diagonal element has been increased by unity). On the right of that 
line are placed the elements of the negative unit matrix. The first 
part of the rows A, to A, is filled up by the matrix M,M,°, ie., by 
the entries of Table II after rows and columns of that table have been 
interchanged. The second part of these rows consists entirely of ze- 
ros. The row marked A, is obtained from A, by dividing each ele- 
ment by the first entry (13.537 in our example) ; the row A, is then 
no longer required. The check column contains the sum of all the ele- 
ments in the corresponding row, but in the course of the work the 
check column is treated in the same way as any other column, the 
check being that each of its elements must at every stage be the sum 
of all the other entries in the same row. 

The elements of B, are found by calculating all those tetrads of 
A, and A, in which the “pivot” 1.000 (i.e., the first element of A,) 
occurs; e.g., 


(1.000) (9.065) — (6.268) (.462) = 6.170 , 
(1.000) (0.000) — (6.268) (.419) = —2.626 , --- 


We then divide each element of B, by the first entry, 6.170, thus re- 
ducing the pivot for the next step to unity; the quotients are listed in 
row B,. The elements of B,, B.,--- are all the tetrads which can be 
formed from A, and A,, A, and A,, --- respectively, the first entry 
of A, always being retained as pivot. 

This process is continued until the space on the left of the cen- 
tral vertical line is cleared of all entries. The required regression co- 
efficients, i.e., the elements of the matrix M,?M,(J + J)-, will then 
be found on the right of that line. 

In conclusion, we should like to emphasize once more that the 
above method will be found advantageous only when the number n of 
tests is considerably greater than the number r of factors. We have 
used the example (n=7,7r=3) merely to illustrate the procedure. For 
small values of n and r a more direct way of arriving at the result may 
sometimes be more convenient. 
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THE METHOD OF EQUAL APPEARING INTERVALS 


F. M. URBAN 


Brno, Czechoslovakia 


The results of J. P. Guilford’s card-sorting experiment by the 
method of equal appearing intervals are analyzed mathematically. 
The constants of the psychometric functions are obtained by process- 
es analogous to the constant process. When these constants are de- 
termined (a) independent of the supposition that the series of 
groups into which the cards were sorted is quantitative in character 
and (b) under this supposition, good agreement is found between 
them. Guilford’s results agree with Weber’s and Fechner’s laws in 


both cases. 


The method of equal appearing intervals has two forms. In ex- 
periments of the first type the subject adjusts one or several variable 
stimuli between given limits A and B in such a way that the differ- 
ences between successive stimuli seem to be equal. In experiments of 
the second type the subject sorts a number of fixed stimuli 2%, 22, ---, 
Im, prepared beforehand by the experimenter, into equidistant groups 
Ys, Yo» -** » Ynw In each case the final result consists of a number of 
groups to which certain stimuli are assigned, but in experiments of 
the first type any value between A and B may be obtained, while in 
experiments of the second type only the values prepared by the experi- 
menter come to observation. This difference in the data requires cer- 
tain changes in their statistical treatment. We will speak in the 
present paper of the latter form of the method of e.a.i. only. It is of 
considerable interest and promises to throw some light on the mental 
process by which we bring a group of objects into serial order. 

One usually lets the subject handle the stimuli and transfer them 
to the proper groups. This procedure is very satisfactory, but is by 
no means an essential feature of the method. We might just as well 
present the stimuli by some mechanical contrivance and let the subject 
indicate orally or in writing the number of the group to which each 
stimulus belongs. The perception of the stimulus releases a mental 
process resulting in the decision that the stimulus belongs to such and 
such a group. The subject must solve this task, but it is immaterial 
whether he indicates his judgment orally, in writing or by bodily 
transferring the stimulus to the proper group. We are free to choose 
between these possibilities on the ground of experimental convenience. 

Sorting a stimulus to a group is equivalent to an equality judg- 
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ment, since it implies the presence of the attributes which decide on 
the membership in this group, and denies the presence of attributes 
which are characteristic of other groups. Besides this formal resem- 
blance there exists a certain psychological resemblance between these 
sortings and the equality judgments in the method of constant stimuli. 
The number of the group to which the stimulus belongs either jumps 
into the mind of the subject, so to speak, at once or it is arrived at 
after one or possibly several other groups have been tried and found 
unsatisfactory. The experiences of the subject, who in comparing two 
stimuli passes an equality judgment, are very much the same. 

An individual experiment is not finished before the stimulus is 
assigned to a group, i.e., when an equality judgment is given. In this 
the method of e.a.i. resembles the method of average error, in which 
the subject has to find a stimulus which is equal to the standard. The 
use of the equality judgments is essential in the method of just per- 
ceptible differences too. From this it follows that there cannot exist 
a simple relation between the results of these three methods and those 
of the method of constant stimuli, if in the latter we suppress the 
equality judgments by instructing the subject to choose between the 
judgments smaller and greater only. Neither can we obtain an in- 
sight into the methods of physical measurement, if we eliminate the 
equality judgments in our analysis of the methods of psychophysical 
measurement. Physical measurement aims at finding a value which 1n 
the perception of the observer is equal to the quantity to be measured 
and the whole process becomes meaningless, if we eliminate the 
equality judgments. 

Persons who have no previous experience as subject, in experi- 
ments of this kind, need some guidance by the experimenter. Some 
subjects even doubt their ability to find a satisfactory arrangement. 
It is best to show them first the limits A and B of the interval to be 
divided by equal steps. One then lets the subject play with a series 
of appropriately chosen stimuli and after some failures the subject 
succeeds in arranging a satisfactory series. He then must make him- 
self familiar with it by some preliminary experiments during which 
frequent recourse may be taken to the standard series. In some fields 
of work this training succeeds surprisingly well. The subject soon 
becomes familiar with this standard and then places considerable con- 
fidence in his determinations, though he may have doubted at first the 
possibility of such groupings. To an experimenter, who knows the 
very real difficulties of these experiments and is acquainted with the 
variability of the results, the sudden growth of this confidence is a 
very curious observation. 
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We know little about the mental process by which the subject 
decides on the group to which the stimulus belongs. The time it takes 
varies considerably and we may presume that there are important 
differences between these processes in different experiments. Their 
classification should begin with a purely qualitative description of the 
mental experiences of the subject. It will be best to work with sub- 
jects practised in introspection, the emphasis on obtaining introspec- 
tive evidence on the mental process which is released by the perception 
of the stimulus. 

Owing to the present lack of information in regard to this mental 
process, our instructions to the subject are not very definite and this 
causes a certain looseness in the experiments. The method of e.a.i. 
shares this shortcoming with the methods of just perceptible differ- 
ences and of average error. All these procedures are eminently 
suited for work with untrained subjects, because they require no con- 
trol of the attitude of the subject. This is not a fundamental objection 
against these methods, but rather an incitement to describe the atti- 
tude with which the untrained subject approaches his task. There is 
a place for work under laboratory conditions, and there is a place for 
procedures suited for clinical and anthropological work. We may make 
an experiment more precise by defining the attitude of the subject, 
but we restrict at the same time the usefulness of this procedure for 
work on untrained subjects who are not able to control their attitudes. 
It seems best not to strive after too high a precision in experiments 
which after all never can equal the conditions of the method of con- 
stant stimuli in lucidity and precision. 

The task of the subject is difficult, if the number of groups is 
not quite small. Each group represents a certain type of judgment 
and the greater their number the harder it is for the subject to ac- 
quire and maintain his standard of judgment. Let us explain these 
difficulties by reference to other experiments. 

The standard of judgment in the method of e.a.i. is somewhat 
artifical, as is seen by the fact that the subject first needs some in- 
struction and preliminary experiments, while the task of the subject 
in the method of just perceptible differences requires no explanation 
at all. In the method of constant stimuli the subject has to decide 
between three judgments — smaller, equal, greater — and in the 
method of multiple cases between five judgments — much smaller, 
smaller, equal, greater, much greater—, all of which are familiar to 
the subject from everyday experience. In spite of this it is very hard 
to maintain one’s standard of judgment with three judgments only, 
and it takes a very practised subject to handle five judgments com- 


petently. 
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In the method of e.a.i. the subject may be required to sort the 
stimuli into eight or more groups, the series of which forms a some- 
what artificial standard supported by the limited experience of the 
preliminary experiments only. The standard which a subject may 
have acquired for n equidistant groups between A and B, does not 
necessarily apply for n groups between A’ and B’ or for m groups be- 
tween A and B. As a rule additional practice is needed, if the subject 
is to work in a new range or if the number of groups is changed. 
Many investigators have reported on the difficulties for the subject in 
the method of ¢.s. to maintain his standard and always connect the 
same meaning with his judgments. Similar complaints about the 
experiments by the method of e.a.i. are rare, but we may presume that 
here the difficulties are still greater. It seems very much worth while 
to collect data for settling this point, which has an important bear- 
ing on the question of the constancy or variability of the experimental 
conditions. 

We base the analysis of the method of e.a.i. on the principles 
explained in the monograph on The Application of Statistical Methods 
to the Problems of Psychophysics.* We regard the experiments as 
chance events with constant mathematical probabilities. These sort- 
ings certainly have the subjective character of chance events, since 
we cannot foresee the outcome of an individual experiment, and we 
assume that the experimental conditions remain sufficiently constant 
as to warrant the view that we have to deal with constant probabili- 
ties. The observed numbers of relative frequencies are empirical de- 
terminations of these unknown probabilities, which implies that they 
are affected by errors of observation distributed according to the 
probability function. 

The fina] results of a series of experiments by the method of 
e.a.i. may be tabulated in this way: 


Y Yo oe Dk-a Yx Yr S/S 
x Qy1 Ao so0 Oy ker Ak Qs k+1 “s+ Ain 
Lo A, Qe 2% ens Ax, Ae +1 7+ AN 
Lm Qini Ame *9* Am k-1 Omk Om ke s°* Amn 


*Urban, F. M. The application of statistical methods to the problems of 
psychophysics. Philadelphia: Psychological Clinic Press, 1908, pp. ix + 221. 
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where a@;, is the number of experiments in which the stimulus x; was 
assigned to the group k. The sum 

Gia + Gia + +++ -+in = 8; 
gives the number of experiments which were performed with the 
stimulus x. It is advisable and convenient to use all stimuli an equal 
number of times so that 


$1 = Ss — eee =Sm. 
Let us introduce the following notations: 


Wik = Qi, + Gig + -+- + Giza 
= 1,2---,™m 

Vik = Wik ( 

W ik = Wiksr + Wikre +--+ + Win Se ree 
ui. gives the number of experiments in which the stimulus 27; was 
assigned to one of the groups smaller than y,. Taking y, as reference 
we may say that ui is the number of experiments in which 27; was 
judged smaller than y;. In the same way wi; is the number of experi- 
ments in which x; was assigned to one of the groups greater than y; , 
and in this sense w;, is the number of experiments in which x; was 
judged greater than the standard y;. vi is the number of the equality 
cases, i.e., the number of experiments in which x; was sorted into yx. 

The results obtained by the method of e.a.i may be split up into 

a number of tables containing data for treatment by the method of 
constant stimuli with the groups y2, Ys, --- , Yna aS Standards. The 
table for the standard y; may be given as an example: 


smaller equal greater 


Xv, Unk Vik Wik 
Xe Uek Vek Wok 
Lm Umk Umk Wk 
Dividing the numbers of each line by s; , 82, +++, $m we obtain the rela- 


tive frequencies of the judgments smaller, equal, greater which we 
need for calculating the constants of the psychometric functions with 
reference to the standard y,. Similar tables may be obtained for the 


values k = 2,3,-++,n—1. 
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We indicate the constants of the psychometric functions of the 
smaller and greater judgment referring to standard y, by h™,, eM, 
and h,, c®;, respectively. These quantities must be determined 
from the experimental data. The stimulus 2; gives the observation 
equation 

h®),, Xi + ce), a y 2) ix 
with the weight s; P‘);, for the determination of the constants of the 
psychometric function of the “greater’’ judgments with reference to 
the standard y,. The same stimulus x; gives the observation equation 


Shar et vj + CO xy = Vis hea 

with the weight s; P;.";., for the determination of the constants of the 
psychometric function of the “smaller” judgments with reference to 
the standard y;.,. The values y,, Pix, yin, Pix depend on 

Wik = Qiks + Giz + + Qin 
and 

Uiksr = ir + Wie = + ix. 
From this it follows that 


Wi ,k+1 
$j 





Wik _ 
si ile 


Since the relative frequencies add up to unity, we have 
Po, = P),,.. 


and y);, and y;;., must be numerically equal. Since, furthermore, 
the frequencies refer to opposite judgments, we have 
2. = 9): es . 

The two systems of observation equations are identical and we 
see that the constan!s of the psychometric functions of the “greater” 
judgments for the standard y, are equal to the constants of the psy- 
chometric functions of the “smaller” judgments for the standard 
Yx.,. This important proposition reduces the work of evaluating the 
the data by exactly 50%. 

J. P. Guilford publishes the results of a card-sorting experiment 
on the perception of the density of spots, which exactly answers our 
purpose.* The first step consists in finding the relative frequencies 


Uik . " pee ‘ 
—. Entering these numbers in a table of similar construction one 


i 


obtains in each column the data for the determination of the con- 


*Guilford, J. P. Psychometric methods. New York: McGraw Hill, 1986, p. 164. 
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stants of the psychometric function of the smaller judgments for the 
appropriate standard. Writing down the observation equations is 
quick work, since many of the observations have the weight zero. Set- 
ting up the normal equation requires some care, since the stimuli are 
not equidistant and the multiplications must be worked out. The aux- 
iliary table, for the method of constant stimuli, can be used for the 
products yix Pix only. It is best to follow closely in the computation the 
scheme explained in the paper “Die psychophysischen Massmethoden 
als Grundlage empirischer Messungen.”* The following table gives 
the results of this computation. S“; and S‘); are the lower and the 
upper limits of the interval of uncertainty, defined as the stimuli 
which give the probability 0.5 to the judgments smaller and greater. 
&; is the estimation of group i. To each group belongs an estimation 
value which is representative of the whole group, and an interval of 
uncertainty which is inversely proportional to the accuracy with 
which the subject perceives this group. This quantity depends on the 
precision with which the subject handles his mental standard of the 
different groups. 
































Seale| h®,; SQ; he); S@); S@ -S@; g; 
value 
2 | 0.1493 | 13.80 | 0.1723 | 19.30 5.50 16.75 
3 | 0.1723 | 19.30 | 0.1614 | 23.12 3.83 21.15 
4 | 0.1614 | 23.13 | 0.0880 | 29.83 6.70 25.50 
5 | 0.0880 | 29.83 | 0.0882 | 38.46 8.63 34.00 
6 | 0.0882 | 38.46 | 0.0703 | 48.59 10.13 43.13 
7 | 0.0703 | 48.59 | 0.0843 | 62.92 14.33 56.40 
8 | 0.0843 | 62.92 | 0.0596 | 76.00 13.08 68.34 
Owing to the relation S®; = S“,,, intervals of uncertainty for 


successive standards are contiguous. They cover the whole range com- 
pletely and there are no gaps between them. They have a pronounced 
tendency to increase with increasing standards and the values at the 
end of the table are considerably larger than those at the beginning, 
but the regularity of this trend is interrupted in two places. 

The constants h set in with high values and show a sudden break 
for group 4. The new level is maintained up to the last term which is 
considerably smaller than the others. 

These quantities indicate the steepness of the curves representing 
the psychometric functions. The greater the value of h, the steeper 
the corresponding curve. The interval of uncertainty indicates how 


*Urban, F. M. Die psychophysischen Massmethoden als Grundlage empir- 
ischer Messungen, Arch. f. d. ges. Psychol., 1909, 15, 261-415. 
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far the two curves are drawn apart and has the following significance: 
The rectangle with unity as height and the interval of uncertainty as 
base is equal to the area included between the curve for the psycho- 
metric function of the equality judgments and the abscissa. This 
area is proportional to the total number of equality judgments given 
by the subject in the course of the experiments and in this sense indi- 
cates the importance which this particular group has for the subject 
when sorting the stimuli. The significance of the constants h and 
S®,; — S@, has an important bearing on the results under discussion. 

A judgment, the probability of which has no significant figures on 
the first two decimals, has no influence on our data. This depends on 
the number of experiments performed, and in a longer series of ex- 
periments we will have to go beyond this limit. Let U; be the stimu- 
lus for which the psychometric function of the smaller judgments for 
standard i has the value 0.995. All stimuli smaller than U; have prob- 
abilities which for our present purposes may be put equal to zero. In 
the same way we designate by 0; the stimulus for which the psycho- 
metric function of the greater judgments has the value 0.995. All 
greater stimuli have probabilities which for our purposes may be put 
equal to zero. 

Only stimuli belonging to the interval 7 ;T®; may be sorted 
into group 7. Somewhere near the centre lies the estimation value é; 
and around it the corresponding interval of uncertainty S@,S®;. 
Then come to the right and left the intervals of uncertainty for the 
next higher and lower groups, and perhaps the intervals of uncertainty 
of still other groups may belong to T™ ;7; . The length of this inter- 
val depends on the values of the constants of the psychometric func- 
tions for the standard. If h™; and h®); are small, the curves repre- 
senting the psychometric functions are drawn out and the interval 
T™,T®@), will be long. If, furthermore, the intervals of uncertainty 
for standard i and the neighbouring standards are small, T™;,T®; 
will comprise several of these intervals. 

The intervals T;T®, , T%,,T®_,, T%,4.T®;,, partly overlap 
and other intervals may do so too. Let us consider a stimulus x which 
satisfies the inequalities 


TO), ,= 25 T),,. fork = —2,—1,0,1,2 


but no other similar inequalities. This stimulus belongs to all these 
five intervals, which means that there exist definite probabilities, dif- 
ferent from zero, that x will be sorted into one of the groups i—2, 
i—1,i,7+1,7+2. Some of these probabilities may be equal, but as 
a rule they will differ. Their sum must be equal to unity, since the 
subject in carrying out the instructions given by the experimenter 
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must sort the stimulus z into some group, and no other groups come 
into consideration, since all their probabilities are equal to zero. 

The number of groups between which a subject chooses when 
sorting the stimulus x depends on the clearness with which 2 is per- 
ceived, and on the precision of the standard by which the subject 
goes. The greater the number of groups between which the subject 
chooses in sorting  , the less reliable his determination becomes. The 
totality of the groups to which « may be assigned shows a scatter, 
which may be measured by several different methods. The characteri- 
zation of the scatter by the constants of the phychometric functions 
is preeminent, because from these quantities we may reconstruct the 
entire set of results. 

The results of Dr. Guilford’s experiments are given in a table of 
double entry, the first variable being the intensity of the comparison 
stimuli and the second the series of the groups into which the stimuli 
are sorted.* The stimuli are quantities measurable in the ordinary 
sense of the word. The numbers 1, 2, 3, ---, by which we designate 
the groups, are not symbols of quantity but ordinary numbers, and we 
might just as well use the letters A, B, C,--- for designating the 
groups. 

We now assume that these numbers designate quantities and that 
we may use them as any other symbol of quantity. The steps which 
separate the groups are introspectively equal, but apart from this 
there is practically no evidence in support of this supposition. We 
want to see what kind of deductions may be drawn from it and how 
far they are verifiable by experience. The question as to whether the 
series of the groups has quantitative character is of such importance 
that no evidence which might throw some light on it must be neg- 
lected. 

The supposition that the figures y designate ordinary quantities 
enables us to calculate the constants of the psychometric function giv- 
en in the table above by an entirely different method. Our calculations 
until now were independent of the supposition that the series of the 
groups has quantitative character, and it will be interesting to com- 
pare these results with the data obtained under our new supposition. 
The agreement between these results may be taken as support of this 
assumption, although it proves, strictly speaking, just that the pres- 
ent material does not contradict the hypothesis that the series of the 
groups has quantitative character. 


. r Wik Wik saa - 
The relative frequencies ae and —— are empirical determina- 


v “5 


*Guilford, J. P. Psychometric methods. New York: McGraw-Hill, 1936, p. 158. 
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tions of two psychometric surfaces represented by 

Yi (x,y) =4—4 (Ay + But Cy+ Di) = 4—3 Off: (%Y)] 
and y 

Po (%,y) = 4+ 4 ®(Asy + Bor+ Cy+ D2) = 4+ 4 O[2(%Y)). 


The coefficients in these functions are unknown and must be deter- 
mined from the data of observation. We have two separate sets of 
equations, one for the constants of Y, and the other for those of ¥,. 
The same problems arise as in the method of constant stimuli, the 
only difference being that we here have to deal with 4 unknown quan- 
tities. 


The relative frequency a of the smaller judgments for the 


U 


stimulus i with reference to the standard y, gives the observation 
equation 


A, viyr + Bia + Cy. + D, = y™ x with weight P™ j, . 


Owing to wi, = 1 — wWix.,, the corresponding equation in the system 
for the determination of the constants in Y, is 


AUiYn-r ao Bix; + CoYk- oe D, — y(n with weight P44 ’ 
where 
yD ix — y (2) 
P = Ps k1 2 

There exist certain simple relations between the sums which form 
the coefficients in the two sets of normal equations. It is easy to prove 
the following relations by considering the determinants of the two 
systems 

A,=A,, B,=B,+A,, C,=C,, D, = D,+ C, 
from which follows 

f.(x,y) = Aw (y+1) + Bye +C,(y+1) + D, = f,(x,y+1). 

We see that for any standard the upper limit of the interval of uncer- 
tainty is equal to the lower limit of this interval for the next higher 
standard. This proposition follows directly from the equation just 
written down. 

It is sufficient to determine the coefficients of the psychometric 
functions of the smaller judgments. This proposition is very valuable, 
for it reduces the work of calculation by exactly 50%. This is very 
important on account of the great amount of work involved in setting 
up and solving the normal equations. 


i,k-1 








F. M. URBAN 127 


In Guilford’s experiments we have 69 observation equations with 
weights differing from zero involving 4 unknown quantities. It is in- 
dispensable to arrange the work systematically and to provide for 
automatic and thoroughgoing checks. The scheme developed by com- 
puters in the method of least squares does all this, and there is no par- 
ticular merit in adapting this scheme to our particular purpose. Be- 
yond the wish to be of assistance to the workers in this field we con- 
nect with the publication of the scheme an ulterior purpose. The cal- 
culations are entirely too laborious to allow this scheme to be of easy 
and ready use. The method in its present form is not handy enough. 
This was exactly the case of the method of constant stimuli some 30 
or 40 years ago, before the various devices were discovered which 
reduced the necessary labour to such an extent that we may finish in- 
side of 20 minutes the computation on which formerly a whole day 
might have been spent. It is not unlikely that the same may be done 
for the calculations under discussion. 

After bringing the experimental results into the form in which 
they are presented by Guilford, we begin with the calculation of the 


= 3 Ui P = 
relative frequencies —. Choosing 32 as the new zeropoint and call- 


v 


ing a,b,c,d the coefficients of the unknown quantities A,B,C,D, 
we enter the data for the observation equations into the following 
table: 




















| | | YP 
x y| pia b | e¢ y 8 | Ye — 
wen Re rs ee ee 8 = ae et Ee 
—17 | —4 | 0.35 | 51 —17 | —3 0.2725 32.2725 | 0.947 | 0.2581 _ 
—17 | —3 | 0.80 | 34 | —17 | —2 | —0.5951 15.4049 |0/770 — | 0.4579 
—17 | —2 0/98 17 | —17 | —1 | —1.4520 | —1.4520 | 0.188 — | 0.2732 





We need not separate positive from negative items in the first 8 
columns, since we do not need their sums. The values y and P are 
found in the tables for the method of constant stimuli. Since these 
tables must be entered anyhow, it is convenient to tabulate yP too. We 
must provide here for two columns in order to separate the positive 
from the negative terms. 

Remembering that d = 1, we find the products needed for the 
determination of SsP, SaP, SbP, ScP by multiplying the figures 
s,a,b,c¢0n a line by the value of P on the same line. For finding 
these products we need open Crelle’s tables just once. Positive and 
negative values must be kept separate, so that we need a double col- 
umn for each one of these quantities. Retaining three decimals in the 
product we obtain a table of this kind: 
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sP aP bP cP 
ee ae ee ee eco 
30.562 — | 48297 | — | — | 16099 | — | 2841 
11.862 — | 261809 | — | — | 18090 | — | 1.540 
— | 0273 | 3198 | — | — | 3.196 | — | 0.188 


Multiplying the numbers in this table by a we obtain the products 
for forming the sums SasP, SaaP, SabP, SacP, and SayP. Re 
peating this process with the coefficients b and c we obtain the sums 
SbsP , SbbP, SbeP , SbyP, SesP, SccP , ScyP. The check by the 
sums is thoroughgoing. 


Guilford’s data give the following results: 


SaaP = 126,728.732 DSbyP = 49.212 
SabP = 28,859.244 DSbsP = 44,806.112 
SacP = 2,567.708 SccP = = 194.612 

Sap = _=—«1,502.620 dScP = 17.975 
SayP = —37.189 ScyP = —15.265 
DSasP = 159,621.115 ScsP = 4,267.650 
SbbP = 14,222.875 >P = 47.197 
DSbeP = = 1,502.620 SyP= —3s.867 


The sums check up in this way: 
SoP = 172.161 SsP = 1,736.087 
SaaP + SabP + SacP + Sab + SayP= dasP 
DabP + SbbP + SbeP + SOP + DSbyP= SbsP 
SacP + SbeP + SeceP + SceP + SeyP= SesP 
SoP+SbP +SeP +P +3DyP = deP. 


These sums are the coefficients of the normal equations. In solving 
them it is convenient to make use of the fact that the determinant of 
the coefficients is symmetric, so that it is sufficient to write only the 
part above the principal diagonal. It is best to write the data in tabu- 
lar form, discarding entirely the traditional form of equations. 





A | B C D | Check: 
‘fs ZaaP | XabP ZacP zaP ZayP | ZasP 
II. XbbP ZbeP =bP =byP =bsP 
Ill. | SecP | ZeP | ZeyP | ZesP 


IV. | | =P | SyP | =sP 
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The numbers in the last column are equal to the sums of the coefficients 

of a row. To check a row not written in full, one adds the numbers 

to the left as far as possible and from the diagonal on upwards. 
Dividing the figures on the first line we obtain the coefficients of 


the following equation: 


DSabP DSacP SaP _ — SayP 
ct SaaP SaaP * SaaP~ ~—s SaaP 


We now manipulate equations I’, II, III and IV as follows. We 
multiply I’ by the coefficient of B in I, which also is the coefficient of A 
in II, and subtract from II, thereby eliminating A from II. We mul- 
tiply I’ by the coefficient of C in I, which also is the coefficient of A 
in II, and subtract from II, thereby eliminating A from III. We mul- 
tiply I’ by the coefficient of B in I, which also is the coefficient of A 
in IV, and subtract from IV, thereby eliminating A from IV. The re- 
sult of this manipulation is a set of three equations, the determinant 
of which is symmetric, with the unknown quantities B,C, D. Guil- 
ford’s data lead to the following computation: 


A B ; C D 8 


126,728.732 seams 2,567.708 1,502.620 | —37.189 | 159,621.115 | I 
14,222.875, 1,502.620 172.161 49.212} 44,806.112 II 


; SasP 


. YaaP (T’) 


Check 














6,571.954, 584.780 342.183; —8.469| 36,349.640 
194.612 17.975) —15.265| 4,267.651 | III 
52.026 80.444; —0.754| 3,234.165 


47.197| —3.867; 1,736.087 | IV 
17.817; —0.441| 1,892.630 








1 0.22772, 0.02026 0.01186 | —0.00029 1.25955 | I’ 
7,650.921| 917.890 | —170.022 57.681} 8,456.472 | I 
142.586 —12.469| —14.511} 1,033.486 | II 

110.009 —20.395 6.919} 1,014.410 





| 29.380} —3.426; —156.543 | III 
3.778 | —1.281| —187.921 


1 0.1199582 |—0.0222224 | 0.0075391 | 1.1052751 | I” 



































| 32.577 7.926 | —21.430 19.072 | I 

| | 25.602| —2.145 31.378 | II 

| | 1.928; —5.215 4.640 | 

| 1 |—2433005 |-0.657826| 0.585444 | 1’” 

| 23.674 | 3.070) 26.738 | 
D= 0.1297 Check 0.1295 B= 0.09811 


C=—0.6894 A =—0.0091 
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In working out the results by this scheme, nicety in spacing the 
figures and lines is of assistance. Beneath the coefficients of equations 
II, III, IV there must be space for entering the values to be subtracted 
from them. The next steps consist in setting up I’ and entering the 
products in the right places. The results of these subtractions are en- 
tered as coefficients of the second group of equations I, II, III. Fol- 
lowing up this scheme we obtain D and the check assures us of the 
correctness of this value. Working backward we obtain C, B, A from 
I’, I’, I’ by simple computation. 

By means of these constants we may express all the quantities 
which are of interest to us as functions of y. We have 


h(y) = 0.0931 — 0.0091y 


0.6894y — 0.1297 
0.0931 — 0.0091y 


__ 1.8788y + 0.4300 
~ 0.1771 — 0.0182y 


with y = 5 and x = 82 as zeropoints. 

These formulae were used to calculate the quantities which we 
determined directly in the first part of this paper. In the following 
table the columns headed by h’, S’, and &’ give these results for the 
different scale-values, while the data in the columns headed by letters 
without primes refers to the results obtained directly. 





S(y) = 





E(y) 























Scale- | | | | 

value | h, | h’, S, | @, | 38, |e.) *t 4 
2 | 0.1493 | 0.1204 | 13.80 | 13.75 | 5.50 | 4.70 | 16.75 | 16.00 
3 | 0.1723 | 0.1113 | 19.30 | 18.45 | 3.83 | 5.54 | 21.15 | 21.10 
4 | 0.1614 | 0.1022 | 23.13 | 23.99] 6.70 | 6.62 | 25.50 | 27.14 
5 0.0880 | 0.0931 | 29.83 | 30.61 8.63 8.05 34.00 | 34.43 
6 | 0.0882 | 0.0840 | 38.46 | 38.66 | 10.18 | 10.02 | 48.18 | 438.38 
7 0.0703 | 0.0749 | 48.59 48.68 | 14.33 | 12.78 56.40 | 54.66 
8 0.0843 | 0.0658 | 62.92 | 61.46 — — 68.34 | 69.28 


The agreement between calculation and direct observation is poor 
for h , scale-values 2 and 8, since it exceeds 50% of the smaller value. 
In all the other cases the agreement is fair, in some cases even excel- 
lent. It is safe to say that Guilford’s results do not contradict the hy- 
pothesis that the series of scale-values is measurable. In view of the 
far-reaching importance of this statement it does not seem profitable 
to phrase it in a more positive way, although the agreement between 
calculation and direct observation might warrant it. 

There is just one more point of importance to settle: Do the re- 
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sults bear out Weber’s Law and Fechner’s Law? The following little 
table gives the answer. 

















WEBER’S LAW FECHNER’S LAW 
~ Geale- | yi y':é | ft | Sua th 
value 

: ¥§ 0.333 | 0.293 a pam 

3 0.181 0.262 | 27 1.32 

4 0.263 0.240 1.21 1.29 

5 0.254 0.234 1.338 1.27 

6 0.235 | 0.281 1.27 1.26 

7 0.254 | 0.234 1.31 1.26 











Weber’s Law predicts the constancy of the relative threshold, i.e., 
of the quotient of the interval of uncertainty divided by the estima- 
tion value. The first two directly observed values do not conform to 
this regularity, but the other values show very little variability indeed. 

Fechner’s Law predicts the constancy of the ratio of successive 
estimation values. It is surprising how closely the results verify this 
statement as well for the directly observed as for the calculated val- 
ues: For the directly observed values the difference between the larg- 
est and the smallest values is less than 10% of the latter, and for the 
calculated values this difference is less than 6% of the smallest num- 
ber. 

We conclude that Guilford’s experiments agree with Weber’s Law 
as well as with Fechner’s Law, but the agreement is better in the latter 
case. The calculated values conform better with these regularities 
than the directly observed values. 
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NOTE ON FACTOR ANALYSIS* 


EDWIN B. WILSON AND JANE WORCESTER 
Harvard School of Public Health, Boston, Mass. 


Certain assumptions and procedures basic to factor analysis are 
examined from the point of view of the mathematician. It is demon- 
strated that the Hotelling method does not yield meaningful traits, 
and an example from the theory of gas mixtures with convertible 
components is cited as evidence. The justification of current meth- 
ods for determining the adequacy of the reproduction of a correla- 
tion matrix by a factorial matrix is questioned, and a x? criterion, 
practical only for a small matrix, is proposed. By means of a hypo- 
thetical example from geometry, it is shown that results of a Ho- 
telling analysis are necessarily relative to the population at hand. 
The factorial effects of the adjunction of a “total test” to a group 
of tests are considered. Some of the general considerations and ques- 
tions raised are pertinent to types of analysis other than the Ho- 
telling. 


1. Introduction. There have been proposed several kinds of fac- 
tor analysis from the early days of Spearman to the latest days of 
Thurstone, each aiming to do certain somewhat different things, and 
with respect to some kinds of factor analysis there has been suggested 
more than one way to accomplish essentially the same analysis. Now, 
assuming that there are such things as traits or capacities or what 
you will which different individuals possess in different degrees and 
which go to determine the ratings one gets on tests or inventories, it 
is clear that one would hope ultimately to find some fairly direct meth- 
od of assigning to each individual his rating on each such trait, just 
as when specimens of a gas have such physical characteristics 
as pressure, temperature, mass, ..., we undertake to assign the val- 
ues of each of these for a particular specimen by such direct proce- 
dures as the use of a guage, of a thermometer, of a balance, .... 
When, therefore, the psychologist resorts to any complicated indirect 
method of quantitatively determining factors, it is to be presumed 
that he does not have any clear-cut notion as to what it is that he is 
measuring or else that he is following a preliminary procedure of 
measurement because the apparatus for direct measurement has not 
yet been devised — it will not do to consider these alternatives as 
mutually exclusive. 

* The calculations which lie in the background of this Note, but of which 


only relatively few are in evidence, were made possible by a grant from the Car- 
negie Corporation of New York to whom grateful acknowledgment is hereby made. 
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The fundamental concept of factor analysis appears to be that 
the scores of various individuals on any test x are due to a (linear) 
combination of their ratings (if one could but get them) on a variety 
of capacities, f,g,h,---, so that the score x; of the ith individual on 
the test x is 


4, =afitbgitch,+--- 


where a,b,c, ---, are independent of the individual and where (to 
save an additive constant) the quantities x, f,9g,h,---, are meas- 
ured relative to their mean values in a group which is under test.* It 
is further ordinarily assumed that the capacities f,g,h,---, are un- 
correlated} in the group. When a group is subjected to a variety of 
tests x, y, Z, ++: , it is found that the scores on the tests are corre- 
lated with coefficients 7, , 7, Ty, ++ , So that the tests cannot them- 
selves measure degrees of capacity in any lines of the sort designated 
by f,9,h,--:, and considered to be “factors.” 

It is usual to express the scores on the tests in “standard meas- 


ure” so that o?, = o*, = o?, =--- = 1. Inasmuch as the factors f,9,h, 
+++, are apparently considered to be more basic mental characteristics 
than the tests x,y,z, --- , it is to be presumed that if they could be 


measured directly they also would be expressed in standard measure ;t 
at any rate they may always be so expressed simply by changing the 
coefficients a,b,c, --- , and if so expressed we must have a? + b? + ¢c? + 
--- = 1. It is generally assumed that when we have a large number n 
of tests x,y,z, --- , the number k of significant common factors f,g,h, 
-+-, is decidedly smaller. This assumption means that the correlation 
coefficients Tn, Mrz, Ty: , +: , can be reproduced adequately by assign- 


*It is to be noted that the measurements x are ordinarily not relative to 
the means of any group but definite positive numbers like pressure or tempera- 
ture and that to introduce the device of referring them to the mean of a group 
itself leads us away from the concept of a trait as a characteristic of an indi- 
vidual; of course if the traits in which we are interested are those of the group, 
this matter may be quite different. 

+ The assumption of noncorrelation should be noted. It is a mathematical 
assumption imposed in advance of the definition of what factors or traits really 
are and, of course, tremendously constrains the definition. Independence of vari- 
ables as used in the physical sciences is not related to noncorrelation of those 
variables in groups of physical objects. How powerfully the assumption of non- 
correlation impinges upon our notion of what traits are or upon our choice of 
groups which may be submitted to factor analysis may be conjectured from con- 
sidering the statement: No two ratable characteristics of individuals which should 
be found to be correlated in any group which might properly be subjected to a 
factor analysis could both be considered as traits. 

tIn this statement we see again the encroachment of the ideological element 
“group” upon the ideological element “individual.” It is doubtful if there is 
utility, it may be there would be disutility in introducing standard measures of 
such a thing as pressure or temperature or, in the psychological field, of I.Q. or 
any other individual trait. Ordinarily in scientific measurements “zeros” of scales 
and “units” of scales are not defined relative to groups. 
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ing appropriate values to the coefficients a,b,c, --- , by which the n 
tests are linearly expressed in terms of the k factors. 

2. Hotelling analysis would not give back known traits. If now 
we did have, let us say, three tests x,y,z dependent on two factors f,g 


so that 
x=aftbg, y=af+bg9, z=a’f+b’g, 
e4i+ve=1, a? 4.62 =1, a”? 4.52 = 1, 
Tx = aa' + bb’, 7,, = aa” + bb”, r,, = aa" + b’b", 
and if we should apply thereto an analysis of the Hotelling type* 
which makes the “factors” those upon which the greatest (or other 
extremum) variance may be thrown, we should not get back the fac- 
tors f and g. This may be seen by taking a simple numerical case: 
a= 6,b = 8, a’ = .28, b’= .96, a” = .936, b” — .3852. Then the vari- 
ances of the three Hotelling factors are 2.594, .406, and 0. The first 
two are 
f = .6199x + .5678y + .5416z = 1.038f + 1.2329 
9g = .0890a + .6349y — .76752 = —.487f + .411g 
and the last (which would not be obtained) is 
0 = —.780a + .524y + .34382 = Of + 09. 
It is to be remembered that f’ and g’ are not unit vectors,7 i.e., 


* We shall confine our illustrations almost exclusively to the Hotelling type 
of analysis which apparently has been adopted by Kelley (Essential Traits of 
Mental Life), neglecting for the most part both the early Spearman (The Abil- 
ities of Man) and Kelley (Crossroads in the Mind of Man) analyses and the later 
Thurstone configurational analysis(The Vectors of Mind). However, it may be that 
some of the general considerations and points of view of this Note are applicable 
with slight modifications to other types of analyses than the Hotelling — for ex- 
ample, §§ 4-5 should apply to any analysis which aims to reproduce a set of 
correlation coefficients. 

+ An arithmetical word about the factor loadings. By the method of obtain- 
ing these coefficients the sum of the squares of the loadings for each factor is 1, 
it being understood that the factors y obtained by using these loadings are not 
unit vectors but have the variance \. When the analysis is complete, i.e., for as 
many factors as there are tests, the sum of the squares of the loadings for each 
test on all the factors is also 1. Moreover, the products of the corresponding load- 
ings for any two factors is zero and the products, if the analysis is complete, o 
je corresponding loadings of any pair of tests for all factors is also zero. This 

v be stated briefly by saying that the complete set of factor loadings is an 
wae set of coefficients, i.e., such a set as arises in rotating one set of or- 
thegonal axes into another. However, if we use these coefficients to express the 
tests in terms of the factors, the tests being non-orthogonal unit vecters, the fac- 
tors must be considered to include their standard deviations, being thus orthogonal 
non-unit vectors of specific magnitude; and if we use the coefficients to express 
the factors in terms of the tests, the factors again are non-wnit orthogonal vec- 
tors. The use of the orthogonal coefficients is very convenient because the tests 
and factors solve immediately either for the other. There seems to be a confu- 
sion in this matter on p. 60 of Kelley’s Essential Traits of Mental Life. 
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their variances are not 1 but 2.594 and .406 respectively. It is of 
course clear that if f and g are traits with unit standard deviations 
any other units such as 


f.=feosé+gsiné, g,=—fsiné+ gcosé, 


where @ is any angle, may replace them throughout the expressions 
for x,y,z. The question is: Would such traits be useful? Shall we 
consider that any unit vectors whatsoever which arise by a rotation 
from a given set of orthogonal unit vectors may be traits? It is un- 
likely that we should find similar mathematical transformations in 
pressure, temperature, mass, ..., useful in gas theory. If then we 
wish to place confidence in the utility of the Hotelling factors in psy- 
chology it is because of conditions obtaining in psychology and any 
validity to the analysis other than arithmetical must derive from our 
knowledge of that science.* Why should there be any particular sig- 
nificance psychologically to that vector of mind which has the proper- 
ty that the sum of the squares of the projections of a set of unit vec- 
tors (tests) along it be maximum? The significance of the mathe- 
matical ellipsoids introduced in the analytical theories of rigid me- 
chanics or of optics of crystals or in other theories of physics inheres 
in the fact that these mathematical devices follow from the physica! 
properties involved; is the present state of the analytical theory of 
psychology such that any ellipsoids result therefrom? 

3. Hotelling analysis of some data from Gibbs. In a paper on 
“Empiricism and Rationalism,” many of the viewpoints of which may 
be somewhat germane to the present discussicn,s an example was 
worked out upon data from a paper by Willard Gibbs to show what 
would be obtained if the regular methods of partial correlation were 
applied, and how the results would fail to satisfy the rationalistic lean- 
ings of the physicist. It is interesting to treat Gibbs’ group of meas- 

*Thurstone (The Vectors of Mind, p. 181) maintains that the Hotelling unit 
vectors and orthogonal transforms of them are without psychological significance 
unless it be that some particular transform is shown to have such significance. 

+Wilson, E. B., Empiricism and rationalism, Science, 1926, G4, 47-57. The 
detailed study of the simple case of three intercorrelated variables throws enough 
light on some of these questions connected with the Hotelling factor analysis (as 
it does on the Spearman analysis for g) that explicit formulas by which the three 


values of \ may be directly computed are worth while. If @ is that angle less than 
180° for which 


pe PR 
ay’ 22° yz 
cos 6 = ————— where A = 
A3 





ry aL ae +} ve ) 
, 


3 
then 


A, =1+2Acos@/3, >4,=1—A (cos 6/38 — V8 sin 6/3) , 


As = 1—A (cos 6/8 + V3 sin 6/8) . 
The formulas are readily evaluated by logarithms. 
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urements of pressure, density, and temperature for a given mass of 
gas with convertible components (acetic acid) by the Hotelling meth- 
od of factor analysis. The correlation coefficients are rpp = .14, 
tpp = .38, Mor = —.82. The values for 4 are 4, = 1.857, 4. = 1.106, 
j; = .037 showing one large, one medium, and one small variance. 


The factors come out as 
yi = .218P — .657D + .723T, ye = .908P + A07D + .102T , 
v3 = —.362P + .634D + .6837 . 


Here the values of P,D,T are in standard measure relative to their 
means in this group of sets of measurements, and 71, y2, 73 are not 
unit vectors but contain as a multiplier the square roots of the respec- 
tive values of 4. In this form the coefficients are an orthogonal set 


and the solution is immediate as 
P = .218 y, +.908 y. — .862 3, D = —.657 y, + .407 y2-+ .634y;3, 
T = .723 y, + .102 y2 + .683 ys. 


It is difficult to believe that there is any utility to such a solution or 
that the factors found are of any general significance in the theory 
of gas mixtures with convertible components — just as seemed to be 
the case with the regression equations. 

4, Adequacy of reproduction of correlation coefficients. One of 
the factors (y;) in the above analysis had very little variance, viz., 
4, = .037, and would presumably be disregarded by factor analysts. 
If, however, this factor be disregarded the correlation coefficients are 
Y' pp = .15001, 7’pr = .39358, 7’pr = —.84985 to five figures. This may 
seem a fair reproduction of the original correlation coefficients if we 
have regard to the standard errors of the originals, viz., 


Ypp = 144.12, Yer = .882+.11, Tor = —.82+ .04, 


the departures being small fractions (in one case 3/4) of the standard 
errors. This, however, is an entirely erroneous method of judging 
whether the reproduction is satisfactory because the three correlation 
coefficients are themselves correlated so that their variations are not 
independent. It is well known that when variables are correlated, 
changes in them by considerable amounts may not be significant when 
those changes are in certain ratios, whereas changes of tolerably small 
amounts which are in certain other ratios may be highly significant.* 

* This point seems often to be omitted in statistical treatments although it is 
of course implied in some of the advanced discussions of least squares and of cor- 
relation. A detailed illustration of what the point may mean for particular cases 


may be found in Talbot, F, B., Wilson, E. B. and Worcester, Jane, Basal meta- 
bolism of girls, Amer. J. Diseases Children, 1987, 53, pt. 2, 275-347 with especial 


reference to §§14-17. 
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With the sample as small as it is here (65), though not many 
times smaller than some samples used in factor analysis of mental 
traits, and with a correlation coefficient as high numerically as 
Tpr = —.82, it is far from certain that the usual formulas for the cor- 
relation coefficients between correlation coefficients are sufficiently 
sound or that variations about the mean values may be considered as 
nearly enough normal, — yet these assumptions will be made, and if 
one is to avoid all sorts of complexities, must be made, in discussing 
these variations that seem so small as to be quite insignificant. Let 


the deviations be 
Ypp— 14= 41, Mpp—.38=%2, Mypr+ 82=2;, 
with o, = .12, «. = .11, «; = .04. Then their intercorrelations are 


aa —.82 eas — .09 >. ta 18. 


It will be noticed that the correlations between the large coefficient 
—.82 and the two small ones (.14, .88) are both small but that the 
correlation between the two small coefficients is large and negative. 
The expression for the quadratic form which occurs in the exponent 
of e in the normal distribution for variations x,, x,, x, from their 
mean values 0 is 


—}[3761x", + 4317%?, + 1128127; — 13504%7.2, 
— 126582,2, 4+- 7973%,2%2] 


and that which appears within the brackets corresponds to the value 
of 7’ for those variations. Now if we substitute the particular values 
of 7,, 2, 3, namely, x, = .01001, x. = .01358, x, = —.02985, aris- 
ing in the comparison of rv’ and r from which we started, we find 
xy? = 21.6, which, for three degrees of freedom, gives a probability of 
well under .01 — i.e., the fit is too bad to be accepted, instead of ap- 
pearing as tolerably good when the variations are compared with the 
sampling errors. On the other hand, had the variations happened to 
be + .1135, + .05166, + .1000, all signs being alike, in place of .01001, 
.01358, —.02985 the value of 7? would have been about 6, with the 
probability more than .05, although one of the three variations is 24 
times its standard error and another one nearly equal to it. 

In this discussion we have computed the new correlation coeffici- 
ents 7’, obtained by ignoring the third factor, by scaling up the load- 
ing coefficients of the two remaining factors so as to leave the values 
of the scores on the three tests still in standard measure. It may well 
be that psychologists would not do that. If we should simply ignore 
the third factor and compute r’ from the loading coefficients of the 
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other two factors, thus no longer having standard scores for the tests, 
we should have 7’ pp = .14852, r’pr = .38920, 7’ pr = —.83612 with devi- 
ations .00852, .00920, —.01612 which would give 7? = 7.94 and a 
probability of less than .05. Whether these values of 7’ may be treated 
as correlation coefficients in this chi-square argument seems somewhat 
doubtful; they are actually mean products of somewhat modified non- 
standardized scores and are, of course, numerically less, though not 
much less than the corresponding correlation coefficients. 

5. What should be the test for adequacy of reproduction of cor- 
relation coefficients? While speaking of ignoring one factor out of 
three and considering that the correlation coefficients and the scores 
may be adequately reproduced by two factors, it may be worth while 
to point out that with the aid of the quadratic expression for 7? rela- 
tive to the stated values of r as means we may raise (and with con- 
siderable arithmetic, of which the detail is here suppressed, may solve) 
the problem of finding what two factors, assuming from the start that 
there shall be but two, will best reproduce the correlation coefficients 
+. We have to consider the determinant of the modified coefficients, 


viz., 


1 14+ 2, 38 + H, 
38 + X —.82 + x; 1 


as equal to zero, because this is the condition for resolution into two 
factors, and minimize 7? subject to this condition. The results turn 
out to be 7’pp = .02258, r’pr = .30760, 7px = —.94480, fairly well re- 
moved from the original values of 7, and 7? = 10.3, which gives a 
value of P between .02 and .01, but of course considerably higher than 
that for 7? = 21.6 found when the factors were kept collinear (as vec- 
tors) with the two most important ones in the solution for three 
factors. 

When offering these suggestions relative to methods of testing 
whether a bank of correlation coefficients is satisfactorily reproduced 
by another bank it should be observed that as the number of tests in- 
creases the difficulty of the method increases perhaps as the square 
of the number of tests. Thus Thurstone for 15 tests gives the table 
of deviations of the correlation coefficients (mean products?) when 
computed from five factors of which one is disregarded.* There are 
105 correlation coefficients between the original 15 tests each with 


* Thurstone, L. L., loc. cit., 118-116. The mean value of the 105 correlation 
coefficients is .816; the largest is .641 and five others are above .5; whether a sim- 
ple test based on standard deviations without reference to the effects of correla- 
tion is valid may be doubted. 
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its own standard deviation and each of the 5460 pairs with a correla- 
tion coefficient. The expression for y? would therefore contain 5565 
terms and would be entirely unmanageable unless some approximate 
method of treatment could be devised; yet with correlations between 
the correlations it is not easy to prove, though it may be easy to state, 
of the deviations of one bank of correlation coefficients from another 
“that the residuals are so small that they can be ignored.” Of course, 
it is not to be overlooked that the y? criterion as applied here may not 
be an appropriate statistical (probability) device to determine when 
residuals are so small that they can be ignored; the question then 
would be: What is the appropriate criterion? 

As science has to do with the discovery of uniformities it is to be 
presumed that no probability device can be regarded as statistically 
sound until one has advanced far enough into a field to get some feel- 
ing for what are the uniformities and further for what are the toler- 
ances with respect to variations. Thus it is not asserted that the above 
test is appropriate, but only a question is raised. Considerable pro- 
gress with respect to feeling out the uniformities or the lack thereof 
in an example of factor analysis has recently been made by Lorge and 
Morrison ;* if we had enough similar studies in this complex field we 
should begin to get that sort of light on the real sampling process in- 
volved, and on the variations due to it, which might go far to deter- 
mine proper criteria for the satisfactory reproduction of correlation 
coefficients and for satisfactory determination of the significance of 
factor analysis itself as a determiner of meaningful traits. 

6. Hotelling and Spearman analyses of a set of three tests. For 
the present note it remains to discuss a few other mathematical mat- 
ters which may throw some light on what the mathematics indicates 
in the absence of adequate study of the psychological meaning. Con- 
sider three variables (test scores) determined by three factors a,b,c 
of unit standard deviations: 


r= Vlat+V2b+V7%e, y= V2a4+ V5b4+ V3c, 
2=V4a+ 7.164 V5e; 
Toy = 9159, 1, = 9330, 7,. = .8937. 
The Hotelling analysis shows one large and two small factors: 

* Lorge, I. and Morrison, N., The reliability of principal components. Sci- 
ence, 1938, 8%, 491-2. The authors compare two batteries of 5 tests each which 
we understand them to consider to be psychologically equivalent. From the names 
of the individual tests in the two series one is tempted to believe that the tests 


are considered to be equivalent individually in pairs. If this be so one may make 
up a large number of equivalent batteries and apply a similar analysis to each. 
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1 = 2.8286, .1085, .0630, 
y, = 58162 + .5734y + .5770z = .8053a + .8480b + 1.2087 , 
vy, = —.1791a + .7822y — .5967z = —.0842a + .2843b — .1434c , 
ys = —.79352 + .2437y + .5577z = .2108a — .0062b — .136 1c , 


where the y’s are not unit vectors but carry the standard deviation 
which is the square root of the corresponding variance 4. As in the 
earlier two-factor example, if the original factors were traits of some 
interest to the psychologist, it is difficult to see how the y’s which are 
special to the particular group should be. 

In this case we can, by introducing the appropriate communal- 
ities,* reduce to a single factor. The communalities which enter the 


main diagonal must be 


h?, = 9562, h*, = .8773, h?, = .9105 ; 


then 
x= 9TT8y, y= .9386Ty, z= .9542y, 


where y is a unit factor. This form makes the resultant x,y, and z not 
standard unit scores, a part of x,y, and z having been rejected as not 
common to the problem; i.e., there are to be added to x,y,z certain ele- 
ments ¢, 7 , ¢ which are random with respect to y and to one another, 
with 

o’- = .0488 ’ o*» = .1227 ’ o*, = .0895. 


The correlation coefficients are reproduced exactly. This is of course 
similar to Spearman’s analysis for g as} 


t = 56212 + .1929y + 26942. 


If we revert to the case taken from the work of Willard Gibbs on 
acetic acid we find that there are no communalities that will reduce 


* As used by Thurstone, loc. cit., p. 86. 

} This value of t has o?, = 1 and is in reality that team test of unit variance 
which correlates most highly with g , part of which is in reality unknown. This t 
is identical with the unit vector y above. For g we can only write g = 1,,t + €, 
where ¢ is unknown. The formula for ¢ is 

x cotacsca + y cot B ese B + z cot y csc Y 


c= ? 
(cot?a + cot?B + cot?y)#(1 + cot?a + cot?B + cot?y)? 


T moe \* 
where cos a = ( =e. 1, te; 
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the rank of the matrix to 1 and lead to a single factor; there is the 
possibility in an infinite number of ways of reducing the rank to 2 
and the analysis to two factors. 

7. Hypothetical factor analyses from geometry. Let us next turn 
to a hypothetical example. We do know from our acquaintance with 
geometry that if a rectangular parallelopiped, which we may call a 
brick, has length 1, breadth b, and thickness ¢ its volume is lbt, its 
total surface is 2(Jb-+ bt-+ It) and the total edge is 4(/-+- b+ f). 
It may be reasonable to think of 1,b,t as physical traits of bricks in 
terms of which more complex characteristics (test scores) may be 
relatively simply albeit not linearly expressed. Now let us suppose 
for the example that we did not know these traits (dimensions) nor 
their relation to the characteristics volume, surface and total edge, 
but that we could measure the volume, surface, and total edge, — a 
situation that may possibly actually have arisen before the relations 
of length, breadth and thickness to all these characteristics were dis- 
covered. What should we learn concerning the underlying physical 
traits of the bricks from a factor analysis upon measures of volume 
V , surface S and total edge FE of a group of bricks of not too dissimi- 
lar appearance? Of course the regressions are not strictly linear but 
that might never be discovered and under reasonable assumptions as 
to the limited variability of the appearance of the bricks would in 
fact not be likely to be discovered. We shall neglect all “terms of 
higher order,” as the mathematician says. 

If a,b,c be the mean length, breadth and thickness (unknown 
traits), the values of v,s,e referred to their means (infinitesimals of 
higher order being neglected) are 


v=bertacytabz, e=4(x4+y4+2), 
s=2(b+c)x+2(a+c)y+2(a-+ b)z, 


where w,y,z are the variations of the traits about their respective 
means, and will be assumed to be random variations in standard meas- 
ure.* These simple values of the variations of V,S,E about their 
means would not however be used, and could not be, being unknown. 
We should standardize the values v,s,e and compute the correlations 
between them. In terms of the (unknown) trait variations we should 
have 


* This assumption of noncorrelation between the variations ,y,z for the dif- 
ferent bricks of a group is of course a gratuitous assumption which might or 
might not be true of any set of bricks depending on the genetics of their manu- 
facture. A similar analysis could be given for any specified values of r,,, 7,. 5 Tyz+ 
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te bex + acy + abz 
V bc? + atc? + a*b? 
Spr be(b-+c) + ac(a+c) + ab(a+b) ai 
be? | arc? | a®b? V (b-Fe)? + (afc)? + (eb)? 
We can now, having thrown away our actual measurements of vol- 
ume, surface and total edge in definite units on a definite scale, per- 


form a Hotelling analysis. Of course everything would be in num- 


bers, not in algebraic symbols as above. 
Let us assume that the (unknown) mean dimensions of the bricks 


were 12; 6; 1. Then we should find on v’, s’, e’ 
Vygr = 8762, 7,,,,= 7095, 7,,,, = 9424 


v's! 


etc 














and we should have that 
v' = .5541 y, + .7648 y. + .38287 ys, 8’ = .6059 y, — .0998 2 
e’ = .5708 y, — .6365 y. + .5187 »;, 

where y, , 72, 73 are the factors not as unit vectors but carrying their 
standard deviation. These could be solved for the y’s in terms of v’, 
s’, e’ if desired and immediately by reversing the matrix of the (or- 
thogonal) coefficients. The expressions of the y’s in terms of x,y,z are, 
in this example 


v1 = 55714” + .7587y + 1.34282, yo = —.3348x — .2979y + .3072z, 
ys = .0891x% — .0874y + .0124z. 





7892 ys , 


The orthogonal system of the y’s reduced to unit vectors can be rotat- 
ed in the orthogonal system of x,y,z (with a possible reversal of one 
direction) but whether one would ever find that rotation and thus re- 
constitute the trait variations x,y,z may be doubted.* The method 
might be of some heuristic value but as soon as we had found our 
geometry we should surely junk it. 

One reason we should junk it is that such an analysis is neces- 
sarily relative to the group. Had we had bricks of mean dimensions 
4:2:1 (approximately the shape of building brick instead of insulat- 
ing brick) we should find 


_ *If there had been correlations between x,y,z, no rotation of the y’s could 
give the x’s because the latter would not be an orthogonal set. It may be remarked 
that no Spearman analysis (for one general) can be given to these two examples 
because, in each, one of the partial correlations, . , is negative. 

> e’.s’ 
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r,., = 9650, 7,,,=.8819, 7, = .9661 


v’ = .5716 y, + .7094 ». + .4123 y;, s’ = .5885 y, — .0043 ». — .8085 »,, 
e’ = .5718 y, — .7048 y. + .4199 y; 

and 

vi = .66592 + .93138y +1.2511z, ye. = —.25386% — .0998y +- .2093z, 


v3 = .04252 — .0607y + .02262. 
It will be observed that the relations of the y’s to x,y,z are different in 
the two cases so that different rotations* would be needed to bring 
the y’s into the system x,y,z, and we still have to worry about a,b,c, and 
l,b,t, and their relations to V,S,Z — it would be a long and tedious 
inductive system back to Euclid, from statistical empiricism back to 
analytical realism. 

8. Adjunction of the “total test.” We may turn next to an- 
other consideration. Some investigators} after making a Hotelling 
analysis of “children’s annoyances,” remake the analysis with the 
addition of the “total annoyance inventory” to the original group 
of annoyance inventories and find, as they anticipated, that the load- 
ing coefficients of the leading factors are unchanged. This observa- 
tion may deserve a mathematical comment. If that system of vectors 
which Wilson suggested and which Thurstone has adopted, at least in 
some of his work, be used here, we may write the matrix with which 
we are concerned as 


@p — UX, ++ U2X2 + X3%3 ++ 


where 2, , 7, %;, +: are the unit vectors representing the tests and 
the dyadic notation of Gibbs is used for a matrix.t This matrix may 
be reduced to 


D = ZU, + AgUoUlg + +++ L AU Uy 


* See footnote on p. 1438. 

+ Carter, H. D., Conrad, H. S. and Jones, M. C. A multiple factor analysis 
of children’s annoyances. Jour. Gen. Psychol., 1935, 4°97, 282-298. 

tIf the test « has the n ratings x,, x,,---, x, for the individuals of a 


group, the expression xx in Gibbs’ notation is the matrix 








©, &,%, ©, 
%o%, %,% —— VoXy, 
UX, UyX LyX 


A discussion of dyadics following the Gibbs’ system may be found for 3 dimen- 
sions in Gibbs-Wilson, Vector Analysis, and for any number of dimensions in Wili- 
son, E. B., On the theory of double products and strains in hyperspace., Trans. 
Conn. Acad. Sci., 1908, 14, 1-57. 
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where ,4A:,A;,°:: are the values of the variances obtained from the 
determ.nantal equation in 4. Now if we add a “total test” y which is 
a linear combination of the x’s as 


Y = (WX, + WX, + W3%3,-+--+)/C 
with C? = w?, + w, + wg +--+ 2wyweti, + 2wWiWstis + +++ , 80 
that y shall be a unit vector, the new matrix becomes 
D = DA yy = AU, H+ Agee ++ + YY. 


As y is a linear function of the x’s no increase of dimensionality is in- 
volved ; we simply get a zero root for 4’ from ®’ plus the same number 
of roots we had before — but the roots will in general be different 
and so will the axes or factors u so that 


@' a Vy wy, + 1'u'U's + vee Vw yw, , 


There are particular cases in which the factors are not changed, name- 
ly, those in which the new “total test” y happens itself to be one of the 
factors u; which themselves are linear functions of the #’s. For ex- 
ample, if y = u, we have 


D’ = (A, +L) Uy + ApUote + +++ AKURUy « 


In this case the loading coefficient for y is 1 in the first factor and 0 
in all others and the loading coefficients of x7, , 7. , %;,--- in all factors 
are just what they were before. It is of course probable that any “to- 
tal test” y which a psychologist would form would be nearly coinci- 
dent with the first factor in his factor analysis. It is however prac- 
tically certain that y would not be identical with u, — if only for the 
reason that the psychologist would probably combine 2,, 22, 23 +> 
with rational coefficients whereas the expression of u, in terms of the 
x’s uses coefficients which are irrational, being “algebraic numbers” 
resulting from the solution of the determinantal equation of degree k . 

When therefore we adjoin the “total test” and find that the fac- 
tor loadings are practically unchanged it presumably means nothing 
with respect to the reality of the factors but only that the “total test” 
is such a linear combination of the original tests as to be practically 
identical with one of the factors, presumably, in a practical case, that 
of greatest variance. However, it is not at all necessary mathemati- 
cally that a simple “total test” be at all like that principal factor. For 
example, if we take the coefficients .14, .38, —.82 used above (§3) in 
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the discussion of the data on pressure, temperature, and density and 
use the simple sum P + T + D, which is the most natural combina- 
tion, we find that, reduced to a unit vector, it gives 


.245 y', + .963 y’2-+ .119 y’;, 


where 7’, , 7'2, 7's, are unit vectors ( not the y:, y2, ys above, which 
carry the standard deviations as multipliers). It will be noted that 
this unit vector lies near to the second factor (A. = 1.106), not near 
to the first factor (4, = 1.857). In many cases however, the sum is 
very closely in line with the principal factor. Thus if we should take 
the coefficients .9159, .9330, .8937 used above (§6), we should find 
that the unit vector corresponding to « + y + z would have compo- 
nents along the directions of 72, y; of only about .0007 each. It should 
be observed that the Spearman team test t, which was obtained in 
this case but which could not be obtained in that of P,D,T because of 
the one negative correlation coefficient, is at least in appearance quite 
different from the unit vector .3433 (a + y+ z) although the angle 
between the two is only about 4° and the component of either unit 
vector perpendicular to the other can be only about .075. 

It is further interesting to observe what happens if instead of 
adding the “total test” and redoing the Hotelling analysis, one should 
merely add (whether on purpose or through inadvertence) one of the 
original tests. This of course could not alter any real factors, traits 
or anything else on which the tests might depend. The general theory 
above indicates that the Hotelling factors in the analysis will be dif- 
ferent. It is instructive to take a simple case. If we use Ty, = .9159, 
Ts, = .9330, ry. .8937 as above and adjoin z’ = z, the correlation 
matrix becomes 


1—A .9159 .9330 .9330 
9159 1—A_ .8937 .8937 
.9330 .8937 1—A_ 1.0000 
9330 .8937 1.0000 1—A 


The four values of 4 are 4, = 3.7854, 4, = .14384, a4, = .0712, 4, = 0 
instead of 2.8286, .1085, .0630. The factor analysis gives for the 
three factors the following loadings — the earlier loadings being re- 
peated for comparison—: 
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I II III IV I II III 
x 4995 1645 —.8505 0 5816 —.1791 —.7935 
y 4887 7571 4335 0 5784 -7822 .2437 


z 5057 —.4471 .2106 -7071 5770 + —.5967 5577 
2’ 5057 —.4471 2106 —.7071 


It will be noted that in terms of the tests the principal factor is now 
(as a unit vector) 


I = 25672 + .2511y + .25992 + .25992’ 


instead of 
I = .34582 + .3409y + .34302z. 


Inasmuch as z and 2’ are identical by hypothesis the last two terms of 
the new I may be combined for purposes of comparison into .5198z. 
It is very clear that the new factor J differs from the old factor J in 
respect to its expression in terms of x,y,z. Of course, a rotation will 
take the new I, II, III into the old J, IJ, III, but the point we are mak- 
ing is that the factors as obtained by the Hotelling analysis are differ- 
ent in the two cases, and since any real factors must be the same this 
illustration alone would seem sufficient to indicate that the analysis it- 
self cannot give the real factors.* 





Spearman’s original observation that psychological tests, as dis- 
tinguished from anthropological measurements, had small tetrad dif- 
ferences and that this could be interpreted as signifying that the tests 
had a common factor was important. The extension to several com- 
mon factors was natural. The inevitable indeterminacy of the sys- 
tem of analysis need not be a fatal handicap, the only serious question 
being whether in fact the analysis does contribute to an understand- 
ing of mental traits.+ The Hotelling analysis has the arithmetic ad- 
vantages of determinacy and of ready mechanical applicability. These 


*If one should desire to add a “total test’ to the system of three tests on 
which one had performed a Spearman analysis (§6) one would find that the ne- 
cessity for the vanishing of the tetrads limited the total tests to being precisely 
the team test t which had already been found and that the team test t’ for the 
system of four tests (three original plus t) would be identical with t but that g 
would be changed in that its known part would be increased and its unknown part 
diminished, which does not make sense inasmuch as the team test t contains no 
new information. 

+ G. H. Thomson has run a long series of penetrating comments on factor 
analysis from at least as early as 1916 to the present time. No attempt will be 
made to cite from the series; the student of factor analysis should be familiar 
with them. We may mention also Tryon, R. C., The theory of psychological com- 
nae ry — an alternative to “mathematical factors,’ Psychol. Rev., 1935, 42, 
t20-404, 
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advantages may bring in with them very real disadvantages. There 
is perhaps nothing more likely to convince one that he has something 
of value than the ability to execute a mechanical arithmetical proce- 
dure. For that reason such considerations as have been offered in this 
Note will not be without usefulness if they serve once more to direct 
attention to the need of examining a formal solution for its psycho- 
logical substance. It would be a pity if Wundt had taken Psychology 
away from her mother Philosophy and married her to Science only to 
have her desert to a paramour Mathematics. 





ere 
ing 
ce- 
his 
act 


to 





psYCHOMETRIKA—VOL. 4, NO. 2 
JUNE, 1939 


DETERMINING A SIMPLE STRUCTURE 
WHEN LOADINGS FOR CERTAIN TESTS ARE KNOWN 


CHARLES I. MOSIER 
University of Florida 


A rigorous and an approximate solution are found for the prob- 
lem: Given a primary trait matrix for n tests and 7, traits, and a 


matrix for the same n tests and r, reference axes, to discover the 


transformation which will transform the second matrix into the 
first, or primary trait matrix. Formulas for determining the limits 
of the effect of using the approximate solution are presented. The 
method is applied to a set of twenty hypothetical tests, defined by 
their loadings on four orthogonal primary traits. After factoring the 
inter-correlations of these variables by Thursto”.e’s centroid method, 
approximating the diagonals, the original hypothetical matrix is re- 
produced with a root mean square discrepancy of .014 by assum- 
ing as known the primary trait loadings of only the first eight tests. 
The method is applied to the results of factoring two batteries of 
14 tests, having 8 tests in common, to give the factor loadings of the 
two batteries on the same reference axes. The method provides a 
means of comparing directly and quantitatively the results of two 
different factor studies, provided they have tests in common, and of 
testing the stability of simple structure under changes in the bat- 
tery. The relations of the method here developed to certain problems 
in multiple correlation are shown. 


In factorial analysis the problem of transformation from an ar- 
bitrary set of reference axes to a set of primary axes which define 
simple structure (5) assumes an important role. Methods have al- 
ready been developed for the case where the relation of the tests to 
the primary axes (traits) is wholly unknown. It sometimes happens, 
however, that the relations of the tests — or some of them — to the 
primary traits are known, either by hypothesis, as in working through 
fictitious problems to test factorial methods, or from the results of 
previous analyses in which these tests were involved. As factor stud- 
ies turn from the preliminary exploratory work to the investigation 
of particular primary traits, the second situation, i.e., where the pri- 
mary trait loadings of certain tests are known, at least approximate- 
ly, will occur more frequently. If two investigators, moreover, factor 
two test batteries with certain tests in common, it is necessary to 
have some analytical method of defining the same trait configuration 
for the traits common to the two investigations. 

It becomes desirable, then, to have a solution to the problem: 
Given a primary trait matrix for n tests and r, traits, and a matrix 
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for the same n tests and r, arbitrary reference axes, to discover the 
transformation which will carry over the second matrix into the first, 
or primary trait, matrix. The problem is developed for the special 
case where the loadings of all tests are known, and then extended to 
the more general case of (n + k) tests, where the primary trait load- 
ings of only the x tests are known. 

Since there will generally be more tests than traits (n >71,), 
some way of combining observations must be found. The method pro- 
posed yields a least squares solution of the problem. In the develop- 
ment of the method it is assumed that the two matrices referred to 
above are of the same order and rank. Later the effect of their being 
of different rank is investigated. 

Let there be: 

m tests whose general subscript is 7 ; whose primary trait 
loadings are known ; 

k tests whose general subscript is 7 ; whose primary trait 
loadings are unknown ; 


7, primary traits whose general subscript is p ; 
rT. reference axes whose general subscripts are m, M. 


Both 7, and 7, are less than n ; in the development of the problem it is 
assumed for convenience that 7, is equal to7r.. 
Let: 
Vjp be the projection of the j-th test on the p-th primary trait 
vector A, , and let V be the matrix of Vj, ; 
ajm be the projection of the j-th test on the m-th reference 
vector, and let A be the matrix of Qjm ; 
Amp be the projection of the p-th primary trait vector on the 
m-th reference vector and let A be the matrix of 


. 
The problem may now be formulated: Given V and A to find A, 
such that: 


D> (Vip — J Am Amp)? = >> p?}p = AMinimum , (1) 
Dp i] 


m p 7 


subject to the condition that: 


fv 

> Atup = 1; (2) 
since each vector, A, , should be a unit vector. Since no constraint is 
put upon the angular inter-relations of the vectors A, , we may, instead 
of solving simultaneously for every value of An, so as to minimize 


> > p’;p» simplify the problem by considering each column of A sepa- 
p>: 3 
rately and minimizing for a fixed p-th column the function: 





t, 
a] 
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U(Amp) => p’ ip (j= 1, 2,--°,7 ; p fixed). (3) 
d 
We may define the discrepancies, or residuals, by: 
Pip = Vip — LV AUjmhmp > (4) 
Then: 
Pip = V7 jp — 2 Vip > Bjmdmp + (S Qjmdmp)? (5) 
and: 


n n n i n Tr 
U(Amp) => pip = > V7 5p — 2D Vip S Umdmp +3 (S Gmdmp)* (6) 
j j j 


m ] m 


It is the function u(4,,») which it is desired to minimize for each 
fixed value of p , subject to the conditional equation (2) 


ys Amp —1l=w (amp) = 0. (2) 


m 


Differentiating (6) with respect to i,,, leads to: 
0U/OAmp = — 2D Vjphjim +2 E J AjmAujdup » (7) 
j j M 


giving 7? equations for the r values of m and the r values of p. Simi- 
larly, differentiating (2) with respect to 1,,, leads to: 


OW/OAmp = Zdmp = 2A. (8) 
Writing (7) in matrix form, 
0U/OAmp = —2A'V+2A'A A. (9) 


By the method of Lagrange’s undetermined multipliers, if 8 be 
the Lagrange multiplier, the matrix A is the solution of the equation: 


0U/OAmp + BOW/OAmp, = 90. (10) 
Substituting from (8) and (9), equation (10) becomes 
—2A'Vt2A'AA—2fBi=0. (11) 
Dividing by 2 and rearranging, we have: 
(V’A+ BI)A=A'V, (12) 


where J is the r * r identity matrix defined on the ranges m, M. 
Equations (12), together with the conditional equation (2), de- 
fine the required values of 4,,, for each p. Equation (6) defines 
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U(Amp) giving the summed squared discrepancies between the original 
—- V and the results of the transformation AA for each column, 


Dp. 3 U(Amp) is the distance between the matrices V and AA (2). 


Binee equations (12) are not homogeneous, their solution is ex- 
cessively laborious, though possible, and the following considerations 
justify an approximation. Since 4m, from equations (12) depends on 
Vjp and a; , and these, in the factorial problem, both represent projec- 
tions on unit vectors, no great error, will, generally speaking, be in- 
troduced by ignoring the conditional equation (2) and assuming for 
each fixed p that the r values of 1, are independent variables. Thus, 
by assuming f equal to zero, the partial derivative of u(Amp) may be 
equated to zero and solved directly for the values of 4m, which make 
U(Amp) a minimum. These values may then be normalized for each 
column p of the resulting matrix A, to give a second matrix, Ay. 

If we set equation (9) equal to zero and solve explicitly for A, 
we have: 


A= (A‘’A)*A’'V, (13) 


which is possible since the matrix A’A is non-singular by Sylvester’s 
Law of Nullity, (1, p. 78), and so has an inverse. This inverse may be 
readily calculated by Aitken’s method,* described by Thomson (4). 
If the resulting matrix A be normalized by columns to give the matrix 
Ao, Ao will differ only slightly from the matrix obtained by the far 
more laborious solution of equations (12) and (2). 

Furthermore, the upper and lower limits of the effect of this ap- 
proximation may readily be calculated. If we designate 


u(i4) as the summed squared discrepancy based on the un- 
normalized xalues of 4 from equation (13), 

u(f, 4) as the value of the function u based on the values of 
A obtained from the rigorous solution of equations 
(12) and (2), 

u(/’) as the value of the function wu based on the normalized 
values of the 4’s from equation (13), 


then: 
u(A) <u(p,s4) <u’), (14) 


since u(/) is the unconstrained minimum value of u, and therefore 
less than the constrained minimum, u(f, 4). But u(f, 4) is the mini- 
mum value of u when the conditional equation (2) is fulfilled, and 


*The writer has recently learned of a more efficient method of computing an 
inverse. See Tucker, L. R. A method for finding the inverse of a matrix, 
Psychometrika, 1988, 3, 189-197. 
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hence is less than any other value of wu for values of Amp which also 


satisfy equation (2). 
The writer is indebted to Mr. Willis C. Schaeffer of the University 


of Chicago for the following’ analytical justification for neglecting f. 
From equations (10) we have: 


0u/dA + Bdow/oA = 0. (15) 
0u/dA = Bdw/od. (16) 


Substituting the partial derivatives from equations (7) and (8), and 
dividing by 2 gives: 


n n 
ss @'mj Vip t+ S Dd Ajm Aju Aup = B Amp + (17) 
7 jij M 
Multiplying (17) by Amp and summing over m gives: 


n tT n f r r 
—% Vin Z Gim dp + 2D Git Ay Qjm Amp = BS Amp - (18) 
b ™m z m m 


But by (2) 
> Amp = 1; 
and hence 
3 ip Sym dno + BE yu day E Am dnp = B (19) 
But 
> Qjm Amp = > Aju Aup = Vip» (20) 


which is the predicted value of v;,. Substituting from equation (20) 
into equation (19): 


p> (4? jp — jp Vip) = B- (21) 
} 
But from equation (4) 
Qjp = Vip— Pip (22) 
and 
B= & (0% —2inp wt pin — p+ Yin Pi) (23) 
or 


n n 
’* (pip —V jp piv) = ~ Pir (pip — Vip). (24) 
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Hence, when p;, approaches zero, 8 approaches zero and the ap- 
proximation is justified. 

In the derivation thus far it has been assumed that r, = 7,. It 
will be useful to investigate what occurs when that condition is not 


fulfilled. If r. > r,, then A, may be determined directly, but it will — 


be of order r. *« r,, and the remaining (7. — 7,) columns of A, may 
then be taken arbitrarily, preferably orthogonal to the 7, column vec- 
tors already determined and to one another. These (7% —v71,) arbi- 
trary axes may then be rotated for the primary traits which they de- 
fine. If the rotation of these additional traits affects the 7, traits, this 
will yield a more precise definition of those latter traits as well. 

If r. < 7,, the composite matrix A’A will be of rank and order 
r,, but the matrix A’V will be of the order r. * 7,, rank 7,. It will 
be possible to determine 7, vectors, A,, of which only 7, are linearly 
independent. It is then possible to select the 7, vectors which best 
transform the columns of A into r. of the 7, columns of V by examina- 
tion of the values of the criterion function, u(A’), for each of the 7; 
vectors, A,. This case is of little interest in the factorial situation and 
is included merely for completeness. 

Thus far the discussion has involved the special case where the 
primary trait loadings of all the tests are-known. Such a case is useful 
only when an hypothetical trait matrix has been set up, the correla- 
tions computed, and the resulting correlation matrix factored under 
some particular condition. The method of equation (13) can then be 
used to determine the effect on the experimentally determined simple 
structure of the experimental conditions (3). 

Of more general usefulness is the case where the battery of tests 
contains, in addition to the n tests whose primary trait composition 
is known, as from a previous analysis, k other tests whose pri- 
mary trait loadings are unknown. The given matrix V will then be 
of order n * r,, the matrix A of order (n+ k) “ r,. Equation (13) 
may be applied to the results of factoring the matrix of intercorrela- 
tions for such a battery, using, of course, only the n-section of A 
in determining A,, and the (n-+ k)-section of A in determining the 
predicted primary trait matrix, V, = A A,. If 7 > 7, the remarks 
made above will apply as before. 

Before the trait-vectors of V, can be known to coincide with the 
trait-vectors of V , the following conditions must be fulfilled: 


1. mn must be greater than 7, , and the matrix V, (n  7r,) must 
be of rank 7, : 
2. the angular inter-relations of the r, traits of V must be 


known ; 
3. each of the r, traits of V must make non-zero contribution to 
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the reduced correlation matrix of the (n + k) tests ; 

4. the factorization of the correlation matrix must be complete 
for the 7, traits ; 

5. wu(d’) must approach zero ; 

6. the angular inter-relations of the traits of V. must be those 


of the traits of V. 


Conditions (1) and (2) are necessary and sufficient to define 
uniquely the axis system of V. Conditions (3) and (4) are necessary 
to the fulfillment of conditions (5) and (6). Conditions (5) and (6) 
are necessary and sufficient that the traits defined by V, coincide with 
the uniquely determined traits of V. The last two conditions, when 
not exactly met, give measures of the degree of approximation. 

Before turning to an example of the application of the method, it 
might be mentioned that equation (13), without normalization of the 
column vectors, applies to the multiple correlation problem of predict- 
ing in turn the several dependent variables represented by the col- 
umns of V from the set of independent variables represented by the 
columns of A. Then: 





Taya =, LCo—O 











Oo est. p = VU mp) /N a Nn (25) 
te = DJ (Vip — Ajp)? 
oS est. Cc — p j 7 (26) 
n 


where o ¢s:. » is the standard error of estimate of the loadings on trait 
p, and o.s:.¢ is the standard error of the trait configuration. 
Examples of the application of the method for the first case, 
where only » tests are involved, are given elsewhere (3). An example 
of the second case, where n tests of known primary trait loadings are 
used to determine the primary trait loadings of (n -+ k) tests, is given 
here. An hypothetical primary trait matrix V , (reproduced in Table 
I), defining an orthogonal simple structure in four dimensions, was 
written down for twenty tests, and the table of inter-correlations 
computed by the matrix multiplication VV’. The resulting correlation 
matrix was factored to four factors by Thurstone’s centroid method, 
estimating the diagonals by the method of highest entry (5), and 
changing signs until the sum of every column, without the diagonal 
entry, was positive. The results of the centroid analysis are given in 
the matrix A , Table II. It was then assumed that the primary trait 
loadings of only the first eight tests were known; equation (13) was 
solved and the transformation A A, made to determine how well the 
method would predict the ‘unknown’ factor loadings of tests 9-20. The 
steps in the solution of equation (13) are presented in detail as an 
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TABLE I e 
Primary Trait Loadings of Twenty Hypothetical Tests 
(Matrix V) 
Trait 

Test p—1 p—2 p—3 p—4 
1 -.700 .000 .000 400 
2 .000 .800 .000 000 
3 000 .400 .600 —.500 
4 300 .000 .000 .800 
5 .000 500 -700 000 
6 000 .000 -700 .000 
“f .600 .000 -000 .600 
8 -.400 -700 300 .000 
9 .400 .800 .000 .000 

10 .000 .000 .000 .900 
11 .000 .400 .600 —.600 
12 -800 .000 .400 .000 
13 .900 .000 .000 .000 
14 .000 .000 -700 400 
15 .600 400 .600 .000 
16 .600 .000 .000 400 
17 .000 .800 -.400 .000 
18 .600 -.400 .000 .400 
19 .600 .000 500 .500 
20 .000 -700 .000 —.600 
TABLE II 
Centroid Factor Loadings of Twenty Tests 
(Matrix A) 

Test p—l p—2 p—3 p—4 
1 58 —.573 —.122 .212 
2 .443 441 —.462 —.183 
3 .410 -710 823 125 
4 385 —.673 —.158 —.357 
5 .633 .449 .270 —.281 
6 062 .166 549 —.172 
7 .513 —.669 —.140 —.007 
8 -785 .310 —.229 .044 
9 .680 .280 —.504 100 

10 .220 —.595 —.131 —.582 
11 373 763 832 198 
12 .688 —.242 261 .449 
13 541 ——.360 —.063 570 
14 454 —.089 .460 —.438 
15 883 123 195 .186 
16 475 —.537 —.110 .145 
17 .639 .540 —.171 —.279 
18 682 —.316 —.344 .056 
19 185 —.485 .293 —.045 
20 216 767 —.319 By & | 
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example of the computational procedure. 


8 
Computing first A’A = 3S @'njQjn> yields 


| 2.2046 
| .1681 
| —.0797 
| —.2641 


j=1 
1681 
2.2527 
4369 
—.0096 


—.0797 
4369 
8040 

—.0239 


Computing the inverse, (A’A)-', yields 


5108 


—.0542 

5019 
—.2796 
—.0487 


0923 
—.2796 
1.4098 
1667 


157 


—.2641 | 

—.0096 

—.0239 
3320 


4110 
—.0487 
1667 
3.3494 


8 
The results of the matrix multiplication A'V = 3 a'nj vj, are: 


_ pl 

1 | 1.1088 

II | —.8804 
III | —.3084 
IV | .0547 


p—2 
1.3844 
1.0783 

—.2657 
—.2061 


p—3 
1.1780 


9495 
6984 
—.2289 


j=1 
p—4 
.6228 
—.5240 
—.4207 
—.2675 





The matrix multiplication (A’A)-“(A’'V) yields A, defined on the 


ranges m and p: 


| p—1 p—2 

I | 607 589 
II | —.418 550 
III | —.077 —.583 
IV | .628 —.218 


-790 
—.212 


—.668 
—.154 





—.636 


When the matrix A is normalized by columns, leading to the matrix 


designated A, , the result 
.625 

—.430 

—.079 

647 


is: 
544 
555 
—.589 
—.220 


522 
.230 
798 
—.213 


.260 
—.690 
—.159 
—.657 
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Multiplying the entire matrix A (order 20 * 4) by Ay, (A Ay = 
> Aim Amp) » 
eiees the predicted primary trait matrix, V, , reproduced in Table III, 
TABLE III 
Primary Trait Matrix Determined from 


Centroid Analysis of Twenty Tests 
(Matrix V, =A A,) 


Test Trait 
p—1 p—2 p—3 p—4 
1 -724 —.004 .003 413 
2 .005 -798 .005 .005 
3 .006 399 .607 —.517 
4 312 .008 —.003 .824 
5 —.001 .496 -708 —.004 
6 .000 .004 .699 .005 
“f 615 —.008 .004 .622 
8 .404 .724 .290 —.002 
9 .409 .800 —.002 —.002 
10 027 —.005 —.002 .871 
11 .007 387 591 —.612 
12 .804 —.013 415 .009 
13 .867 .006 .028 025 
14 .002 .023 .675 394 
15 .604 393 .604 —.008 
16 .630 —.007 .006 .416 
17 .000 .809 382 .004 
18 .626 .086 —.001 .413 
19 .616 —.032 514 509 
20 .010 .670 —.023 —.604 
TABLE IV 
Root Mean Square Discrepancies 
Tests Trait 
p—1 p—2 p—3 p—4_ All Traits 
1-8 011 .010 .006 .014 011 
9-20 .018 .018 015 .014 .016 
1-20* .016 .014 012 .014 .014 
1-20} 012 0138 .010 010 O11 


Comparison of V, with the hypothetical trait matrix V shows a 
very close agreement. Table IV summarizes that comparison, giving 
the root mean square discrepancies for each trait for tests 1-8, tests 
9-20, and tests 1-20. For comparison is given also the r.m.s. discrep- 


* Root mean square discrepancies for all twenty tests, when tests 1-8 are used to determine A. 
f Root mean square discrepancies for all twenty tests, when tests 1-20 are used to determine A. 
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ancy when not eight, but all twenty tests are used to define the matrix 
A. ‘2 
A further condition which must be met (condition 6 pes: is 
that, since the trait vectors A, are by hypothesis orthogonal and the 
columns of A are orthogonal by the method of their determination, 
for A, to transform A into V , A, must also be orthogonal, i.e., A’>A, = I. 
The results of the multiplication A’, A, are here given, and show a 
very close approach to orthogonality. 


TABLE V 
Intercorrelations of Obtained Primary Traits 
(A’, Ay) 
p—l p—2 p—3 p—4 
p—1 1.000 .006 027 .047 
p—2 .006 999 —.009 —.003 
p—3 .027 —.009 1.000 —.009 
p—4 047 —.003 —.009 1.001 


The close agreement obtained warrants the conclusion that the 
method here developed is applicable, not only to those situations where 
it is desired to rotate into a simple structure whose entries are wholly 
known, but also to those situations where the primary trait loadings 
of only part of the tests are known, subject to the conditions discussed 
above. 

As a further test of the method, the following experiment was 
performed, testing the stability of simple structure under changes in 
the test battery. To each of the intercorrelations of the twenty hypo- 
thetical variables defined by Table I was added a chance error com- 
ponent determined so that the distribution of chance error compon- 
ents conformed to the sampling errors to be expected if the twenty 
tests had been given to a sample of 100 cases, and the product moment 
coefficients of correlation computed (8). From the twenty tests two 
batteries of tests were formed: Battery X comprised tests 1-8 and 
tests 9-14; Battery Y was composed of tests 1-8 (as in Battery X) 
and tests 15-20. Correlation matrices for each battery were derived 
from the correlation matrix for the twenty tests, each entry carrying 
a chance error component. Battery X was factored by the centroid 
method, and the section of the V, matrix for tests 1-8 used to deter- 
mine the primary traits by equation (12), solving for f§ for each trait. 
The resulting simple structure was designated V;.o, to indicate that it 
was the V matrix for Battery X, predicted from the hypothetical trait 
matrix V,. Battery Y was then factored by the centroid method, and 
the section of Vx. for tests 1-8 used to determine the primary traits 
as before. The resulting primary trait matrix, Vy) .o, the V-matrix 
for Battery Y predicted from Vy» was then compared with the origi- 
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nal hypothetical matrix, Vo. 
The above experiment duplicates the conditions of administering 


a set of tests of unknown primary trait composition to a group of 
subjects, determining the primary trait configuration, and then using 
a part of that set, together with other tests, to form a new battery 
whose simple structure is to be determined on the basis of the experi- 
mentally obtained structure of the tests common to the two batteries, 
so that the factor loadings of those tests not common to the two 
batteries may be expressed on the same reference axes. 

The resulting fit of Battery Y to the hypothetical trait configura- 
tion is indicated in Table VI, which gives the discrepancies: 


Pip = Ljp — Vip 


where a;, is the primary trait loading of the j-th test for the p-th pri- 
mary trait, from the matrix Vy.xo, and v;, is the hypothetical trait 
loading of that test on that trait, defined by Vo. 


TABLE VI 

Test p—1 p—2 p—3 p—4 
1 —.030 —.088 130 .142 
2 .022 —.021 .016 —.005 
3 —.018 .040 .050 .074 
4 .250 .017 —.031 —.248 
5 .129 —.089 —.052 —.223 
6 —.025 —.013 .039 077 
7 —.060 .029 —.083 .210 
8 —.062 112 —.001 .055 
15 —.165 161 .077 .232 
16 .163 146 —.173 —.223 
17 .260 035 —.026 —.275 
18 .207 —.030 —.003 —.211 
19 .206 —.110 .065 —.111 
20 —.0838 027 .120 —.017 


The root mean square discrepancies are given in Table VII. In 
explanation of the table: The values are listed separately for each 
trait and for the total configuration, by columns; the first row gives 
the root mean square discrepancy between Vy.xo and V, considering 
only the eight tests used in determining the two matrices of transfor- 
mation A,, and A,; the second row gives the value for the tests 15-20, 
those not concerned in determining the rotation; the third row gives 
the value for the battery of 14 tests; the fourth row gives the root mean 
square discrepancy for the same 14 tests when they were included in 
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the battery of twenty, and all twenty tests were used in determining 
the configuration. The last value is thus the root variance due to the 
addition of chance error components to the correlation coefficients, 
but does not indicate the increased error variance due to the reduction 
of the number of tests from 20 to 14. The fit as shown by the dis- 
crepancies themselves, and by their r.m.s. values is quite satisfactory 
for traits p-2 and p-3, but for traits p—1 and p—4, it leaves much to 
be desired. In every instance the r.m.s. discrepancies were greater 
for tests 15-20 than for the tests used in the determination of the 
transformation, but in only one trait, p—1, was the difference striking- 
ly large. 

Comparing the r.m.s. discrepancies for the fourteen tests, deter- 
mined by only eight tests, and using, not the true values of the pri- 
mary trait loadings, but approximations to them, with the similar 
values for the same fourteen tests when these were included in a 
battery of twenty, and the true factor loadings of all twenty tests 
used to determine the transformation matrix shows only one trait 


where the difference is large. 














TABLE VII 
R. M. S. Discrepancies, Vy +, — Vo 
Range 
of Tests p—1 p—2 p—s p—A 
wes 18 1057  .0628 0630 —0UCtC~” 
15-20 .1886 .1018 .0959 1985 
1-8, 
15-20 .1471 .0818 .0792 1745 
1-8, 
15-20 .0637 .0787 .0460 .0494 
from battery 
of twenty 
tests 





From this experiment we may conclude that the application of 
the method here developed to two batteries of tests having more than 
r tests in common in order to determine approximations to the load- 
ings of the tests of the two batteries on the same references axes is 
justified; that the reference axes so determined may be fairly close 
approximations to the underlying primary traits; that the discrep- 
ancies for the tests not common to the two batteries will be only 
slightly larger than for the tests on which the transformation is based. 








162 PSYCHOMETRIKA 


1. 


REFERENCES 


Boécher, Maxime. Introduction to higher algebra. New York: Macmillan, 1912, 
p. 78. 

Eckhart, C., and Young, G. The approximation of one matrix by another of 
lower rank. Psychometrika, 1936, 1, 211-219. 

Mosier, C. I. Influence of chance error on simple structure. Psychometrika, 
1938, 3, 33-44. 

Thomson, G. H. Some points of mathematical technique in the factorial 
analysis of ability. J. Educ. Psychol., 1936, 27, 37-54. 

Thurstone, L. L. Vectors of mind. Chicago: University of Chicago Press, 
1935, pp. xv + 266. 





12, 


al 


s, 





pSYCHOMETRIKA—VOL. 4, NO. 2 
JUNE, 1939 


THE CONTRIBUTION OF AN ORTHOGONAL MULTIPLE 
FACTOR SOLUTION TO MULTIPLE CORRELATION 


P. S. DWYER 
The University of Michigan 


A method is indicated by which multiple factor analysis may be 
used in determining a number, 7, and then in selecting r “predicting” 
variables out of n variables so that each of the remaining n-r vari- 
ables may be predicted almost as well from the 7 variables as it 
could be predicted from all the n—1 variables. 


One of the unsolved questions of multiple factor analysis deals 
with the relation of multiple factor analysis to multiple correlation. 
Roff (1) has previously given some theorems dealing with this sub- 
ject, but his approach emphasized the application of multiple correla- 
tion results in determining communality. The present paper empha- 
sizes the contribution of the factor analysis results to multiple corre- 
lation. 

Suppose that the intercorrelations of variables 1, 2, 3,---,7,-:+,” 
are subjected to a multiple factor analysis which results in 7 common 
factors. The resulting weights are indicated by a;; where the first 
subscript indicates the variable and the second subscript indicates the 
factor. Let the communality of test 7 be indicated by h?;, and the 
uniqueness by w?; (2, p. 63). 

We let the r common factors be represented by 7 orthogonal unit 
reference vectors and prove the theorem, given previously by Roff (1, 
p. 2) with somewhat different emphasis, “The multiple correlation of 
variable 7 with the 7 unit orthogonal reference vectors is equal to the 
square root of the communality of variable 7.” 

We first construct the matrix of the correlations involving vari- 
able j and the r reference vectors. We note first that the correlation of 
reference vector 7 with itself is unity and that the correlation of refer- 
ence vector 7 with any other reference vector is 0. Furthermore, the 
correlation of variable 7 with reference vector 7 is the correlation be- 
between 


Hj = Ujy Yr fF Uo Yo fF 22° Aji Ys o> + Ue Yr + Uy 2; 


and y;, and is 
Aji 


r = == 
HV V0? i, + Wig f +--+ a), Wj 
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= Aji 
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This result was previously indicated by Roff (1, p. 3). In case 
r = 3, the determinant of the correlations becomes 


1 Qj, Ajo Ajs 


| 
| 

ie + | 
Qjz 0 0 1 | 


If now the elements of the second column are multiplied by — a;, , the 
elements of the third column are multiplied by — aj;2, the elements of 
the fourth column by — a;; , and the results added to the first column, 
we get 


Wy Ajy Ajg Ag 





sa 6 @ | 
eo a a 
> ee | 
where u?; = 1 — a?;, — a?j. — aj, . 


The proof for the general case of 7 factors follows in an identical 
manner so that we have 


r 
A = u?;, where u?; = 1— 3 a’; = 1—h’,. 
i=1 


It is at once apparent that the determinant obtained by deleting 
the first row and the first column is 1 so that 4,, = 1. The usual de- 
terminantal formula for multiple correlation (3, p. 301) then gives 


Tjeroe0eer =,/1 _ 4 = Vil —wvj;= h; . 
Ay, 
Another method of establishing this theorem consists in correlat- 
ing 
5 = Uy Ya HP Uj Yo o> ys Yi oe jr Yr + Uy 2; 
with the “common” part of 2; , 
"5 = Oj, Yr + Ajo Yo tes + Qi yi te Tir Urs 
and gives 


ai ? 5, + A? jg +--+ + Oj— 
[aj + aj2+---+4+ a?;, + w?;) (a2;, + a2). +--+ a2;,)]! 
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= Vi FO Fe; = Vij = hy. 

A second theorem is “The multiple correlation of variable 7 with the 
n — 1 remaining variables and the r reference vectors is equal to the 
square root of the communality of variable j.” This theorem was also 
given by Roff (1, p. 2) (with somewhat different emphasis) who im- 
plied that it followed from the first theorem with the statement, “Since 
the correlation between any two tests 7 and k is 0 after common fac- 
tors have been removed, a second theorem follows immediately.” 

A more formal proof is outlined below. 

The matrix of the n variables and the rv reference vectors is com- 
posed of four different types of matrices if the n variables precede the 
r factors both horizontally and vertically. In the upper right hand 
corner there is the factorial matrix, in the lower left hand corner 
there is the transpose of the factorial matrix, while the lower right 
hand corner contains the identity matrix. For example, when n = 8 
and r = 2, we have 





Ayo Age Ase 0 1 


The process used in establishing Theorem II is similar to that used 
in establishing Theorem I. The elements of column 4 are multiplied 
by —a,,, the elements of column 5 are multiplied by —a,,., and the 
results added to the elements of column 1. Similarly the elements of 
column 4 are multiplied by —a., , the elements of column 5 are multi- 
plied by —a.,., and the results are added to column 2. Also the ele- 
ments of column 4 are multiplied by —a,, , the elements of column 5 
are multiplied by —a,., and the results added to column 3. The re- 
sults give a matrix, whose determinant, 4, equals the determinant of 
the original matrix, of the form 


w, 0 O Gy Ay 


0 ws Gs, so | where u?; = 1 —a?;,—a?;,, 
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and it is immediately apparent that the determinant, 4 , of the matrix 
is 
A=, ;. 

By similar reasoning it can be shown that the determinant of the 
intercorrelation coefficients of n + r rows and columns can be reduced 
to a determinant which is composed of four parts. The upper left en- 
tries constitute a diagonal matrix with elements wu’; , the upper right 
entries constitute the factorial matrix, the lower left entries constitute 
the 0 matrix, while the lower right entries constitute an identity ma- 
trix. Hence 

A= u*, u*, --> U,. 

This argument can be reduced to matrix notation. If the funda- 
mental matrix equation is AA’ = R — U? and if we let A(M) denote 
the determinant of the matrix, M , then 

R A|l_ ,{R—AA’ A]_ ,[U? A]_ u 
Al 4 r= 4| 0 T I= 4lo 7 |= 4). 
If A;; is the determinant obtained by deleting variable 7 from the 
variables, then 
Ajj = UP, Ue °* U7 5 W541 00° U,. 


A 
A;; is also the cofactor of 7r;; and . oe U? ;- 
ii 
If we let R; be the multiple correlation of variable 7 with the n—1 
remaining variables and the r factors, then 


R; = va— 4 = y¥ 1—#, = h; ° 
Aj; 

An important corollary follows at once from Theorem II. “The 
multiple correlation of variable 7 with the n—1 remaining variables 
is equal to or less than the square root of the communality of variable 
j.” This, too, was given essentially by Roff (1, p. 4) who empha- 
sized the fact that the multiple correlation might be used to provide 
an estimate of the communality. The emphasis in the present paper 
is on the fact that the square root of the communality, or even an 
approximation to it as indicated by the Thurstone method (2, p. 89), 
may be used as an upper bound for the multiple correlation. (In later 
papers it will be shown how multiple and partial correlations can be 
found from multiple factor results). In the present paper the attempt 
is made to answer the question, “Is it possible to find a small number, 
r, of variables, from which it is possible to predict each of the n—r 
remaining variables almost as well as though each variable were pre- 
dicted from all the n—1 other variables?” In answering this question, 
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the square root of the communality is used as an upper limit against 
which one may measure the approximate loss of predictive power. 

A simple problem involving only four variables has been chosen 
to illustrate the method, as it permits the examination of all possible 
multiple correlation coefficients. The method has also been applied 
with reasonable success to the prediction of the scores on 15 vari- 
ables from the scores of four other variables (4). 

The simple illustration is one previously used by Hotelling (5). 
The variables are, in order: 

(1) memory for words 

(2) memory for numbers 

(3) memory for meaningful symbols 

(4) memory for meaningless symbols. 

In Table I are presented the intercorrelation coefficients which 
Hotelling obtained from T. L. Kelley, rounded to two decimal places. 


TABLE I. Intercorrelations of Four Memory Tests 


Test 1 2 3 4 
1 1.00 .96 mae 4 54 
2 .96 1.00 .86 -70 
3 at 86 1.00 82 
4 54 .70 82 1.00 


With such a small number of variables it is feasible to secure all 
possible multiple correlation coefficients. These are given in Table II. 
The predicting tests are indicated in the first column. 


TABLE II. All Possible Multiple Correlations of the Four Tests 


Average multiple 





Predicting correlation, 
Test Predicted Test not including 
1 } 4 3 4 predicting tests 
1 1.00 .96 oft 04 -76 
One 2 .96 1.00 .86 -70 84 
test 3 77 86 1.00 82 82 
4 54 -70 82 1.00 69 
1,2 1.00 1.00 .88 84 .86 
1,3 1.00 .98 1.00 83 905 
Two 1,4 1.00 98 .90 1.00 94 
tests 2,3 97 1.00 1.00 82 895 
2,4 98 1.00 91 1.00 945 
3,4 19 .86 1.00 1.00 .825 
1,2,3 1.00 1.00 1.00 88 88 
Three /|1,2,4 1.00 1.00 91 1.00 91 
tests 1,3,4 1.00 .98 1.00 1.00 .98 


2,3,4 99 1.00 1.00 1.00 99 
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It appears from the above table that there is a definite increase 
in predictive power as one goes from one-test prediction to two-test 
prediction. There is also considerable variation in the predictive pow- 
er of the different combinations of two tests. The best predicting com- 
bination seems to be the tests 2,4 with the combination 1,4 a close 
second. In the case of three-test prediction, there is a slight improve- 
ment in predictive power. Specifically, the tests 2,4 predict all the 
tests 1,2,3,4 almost as well as do the tests 1,2,4. That is, the inclusion 
of test 1 does not greatly improve the predictive power, except that of 
test 1 itself. The inclusion of test 3 does make some improvement. 

It is possible to determine by exhaustive methods, as in the above 
example, just how many tests and what tests should be chosen. In 
the general case involving a very large number of variables, however, 
such exhaustive methods are out of the question. 

A technique for exhaustive methods has been worked out by 
Frisch (6). As a matter of fact his Confluence Analysis should be 
studied closely by all who work with multiple correlations in which 
the predicting variables are correlated. However, his technique is 
hardly feasible when the number of variables is larger than 12. Ac- 
cording to his estimate (6, p. 97), his “tilling,” which takes only a 
little over four hours for a five-variable problem, would take over 565 
hours for a twelve-variable problem. In such a case it is preferable, 
if the results can be used in obtaining the essential information, to 
utilize a centroid solution, particularly if, as is customary, the num- 
ber of factors is appreciably less than the number of variables. 

The correlation coefficients of Table I were subjected to a cen- 
troid solution. The approximation scheme, which Thurstone recom- 
mends, for placing in the diagonals at each step the absolute value of 
the largest entry in the appropriate row, was followed. The values 
of the communalities resulting are first approximations to the true 
communalities. These approximations are not so good with n and r 
small as with n and *¢ large. 

In a few minutes it was possible to show that, aside from small 
residuals, the intercorrelations could be explained on the basis of two 
factors. The factor loadings, with the corresponding h?; and h; are 
given in Table III. 


TABLE III. The Factor Loadings — Thurstone Approximation Method 


Test I II h2, h, 
1 901 402 973 986 
2 968 233 991 995 
3 924 —,193 891 944 
4 801 —.418 816 908 
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Roff suggested that the multiple correlations might be placed in 
the main diagonal and communalities computed. These are available 
from Table II. The resulting factor weights do not differ much from 
those of Table III. 

In multiple factor theory it is important to isolate primary traits 
and to work out the regression equations giving the primary traits in 
terms of the original variables. The values of specified variables can 
then be obtained by providing regression equations giving the values 
of these variables in terms of the primary traits. It is our purpose to 
attain this end, prediction of actual values, more directly by obtaining 
regression equations giving the values of the n — r variables in terms 
of the values of r of the original variables, hereafter called ‘“funda- 
mental” variables. We would be willing to sacrifice some of the pre- 
dictive power if such direct methods might be made available. It is 
evident that this problem, as stated, demands a least squares solution 
and need not be translated into multiple factor analysis. 

If, however, each of the primary traits of a multiple factor 
analysis coincided with one of the original variables, it would be possi- 
ble to use the values of these fundamental variables as independent 
variables and to compute the values of other variables from them 
without the necessity of considering primary traits as such. Even 
though no variables are coincident with the primary traits, it may 
be possible to find a variable in the vicinity of the primary trait which 
may serve as a fundamental variable. The selection of such a funda- 
mental variable usually results in some small loss of predictive power. 
An approximate estimate of the maximum loss of predictive power 
may be obtained by comparing the multiple correlation resulting from 
these fundamental variables with the square root of the communality 
of the predicted variable. 

If simple structure is present and if at least one of the variables 
is near each of the primary traits, the method is quite certain to give 
satisfactory results. However, the demonstration of simple structure 
and the isolation of primary traits take considerable time. 

It is sometimes possible to make a good selection of the funda- 
mental variables from the centroid weightings themselves. Plot the 
coordinates of the factor loadings and determine from the graph ap- 
proximately which points may serve as vertices so that the hyper- 
planes will come as nearly as possible to passing through the points 
representing the other variables. With such an informal method it 
is possible that different combinations of the original variables might 
have approximately the same predictive power. The justification is 
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not in a formal method of selection, but in the fact that in the re- 
sults the fundamental variables account for the major portion of the 
possible multiple correlation. 

It should be emphasized that the fundamental variables so selected 
are not to be considered as primary in a unique psychological sense. 
They are simply variables which may be substituted for the complete 
set of variables with little loss of predictive power. If unique mean- 
ing is desired, one should follow the method as outlined by Thur- 
stone (2). 

In the illustration above the factor loadings can be used as co- 
ordinates, the points representing the variables (1),(2),(3),(4). 

The question then arises: Which two-point combination of the 
four points should be chosen? We wish to choose two points so that 
the line passing through these points will come as nearly as possible 
to passing through the other two points. Inspection reveals that the 
combinations 1,2 and 3,4 are unsatisfactory. These combinations were 
the least satisfactory combinations as revealed by Table II. Inspec- 
tion also shows that the combinations 1,4 and 2,4 are among the best. 
This result also agrees with Table II. 

As a matter of fact there is little to choose between the 1,4 and 
the 2,4 combinations. Actually the 2,4 combination is slightly better, 
as judged by the standard deviation of errors. Computing the errors, 
we get 


C12, = 011 C2144 = .008 
C324 = .004 Cr, = .009 


so that the line through 2,4 fits the four points slightly better than 
the line through 1,4. This agrees with the result of Table II. 

With the tests selected the problem becomes a straight least 
squares problem. The regression coefficients may be computed from 
the correlation coefficients or from the actual original scores, if avail- 
able. Methods of solving the groups of regression equations which 
are demanded here have been outlined (7), (8). 

The multiple correlation coefficients are then found by the 
formula 


Lj e1Q0007 = VAT i: + BP jot eee +- Bir. 


If there are three factors, the graphing may be done on a sphere 
with the use of augmented coordinates. If there are four or more fac- 
tors, the first three may be plotted in three-dimensional space and the 
values of the loadings on the smaller factors considered in the selec- 
tion of the fundamental] variables. If a formal test of best selection is 
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desired, the distance from each point (representing a variable) to the 
hyperplane may be computed through the tentative fundamental ver- 
tices, and the fundamental vertices may be so chosen that the standard 
deviation of these distances is a minimum. In many cases, however, 
this mathematical work is not demanded, since the justification of the 
selection of the tentative variables is shown by the fact that the multi- 
ple correlation coefficient is nearly as large as the square root of the 
communality. In the above illustration, for example, 


= .978 while h, = .986; 
= 913 while h, = .944. 


SUMMARY 
A method is indicated by which a centroid solution may be used 
in determining 


1. an approximation to the greatest possible multiple cor- 


relation ; 
2. an indication of the number of “fundamental” variables ; 
3. rough methods for indicating tentative selections. 


The limitations of the method are apparent. It (1) is not very 
formal, (2) leaves considerable to the ingenuity of the worker, and (3) 
provides no guarantee that the results are the best that can be ob- 
tained. Its virtue is in the fact that in many cases and with a rela- 
tively small amount of effort, it provides a reasonably satisfactory 
answer to the problem of reducing the number of independent vari- 
ables. It permits the approximate solution of some problems which, 
because of the enormous amount of computation which is demanded 
by a more formal method, would not be attempted. - 
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NOTE ON THE CALCULATION OF THE CHI-SQUARE TEST 
FOR “GOODNESS OF FIT” 


Calculation of the Chi-square test for “goodness of fit” for test- 
ing the normality of distributions is considerably shortened if the for- 
mula is written in a form similar to that used for the calculation of 
Chi-square in contingency tables, as follows: 

Sa Ts 
1 eet Oa a 
"> = Caf 
where f, is the observed frequency, and %f is the percentage of the- 
oretical frequency in each step interval. 


Certain machine shortcuts facilitate the computation. For illus- 
tration we take the numerical example given by Garrett.* 


(1) (2) (8) (4) (5) (6) 
Bottom Deviation of Area from 





0: fo 1% Bottom of Step Mean to %t 

Step from the Mean Step Limit 

in o-units 

114.5 1 1 274 4966 .0034 

109.5 2 4 2.30 4893 .0073 

104.5 4 16 1.89 4706 .0187 
99.5 10 100 1.48 .4306 .0400 
94.5 13 169 1.06 8554 .0752 
89.5 18 324 .65 .2422 1132 
84.5 84 1156 .24 .0948 -1474 
79.5 30 900 —.17 —.0675 | .1623 
74.5 37 1869 —.58 —.2190 .1515 
69.5 27 729 —1.00 —.3413 .1228 
64.5 15 225 —1.41 —.4207 .0794 
59.5 10 100 —1.82 —.4656 .0449 
54.5 2 4 — 2.23 —.4871 .0215 
49.5 2 4 —2.64 —.4959 .0088 
44.5 1 1 .0041 

1.0000 

Mean = 81.59 “ f% 

o= 12.14 = — = 48687.569 

1 %f 

“a = .08287 1 f% 

=f ee NS 
astih w= Tt N= ee 


* Garrett, H. E. Statistics in psychology and education. Second Edition. New 
York: Longmans, Green and Co., 1937. Pp. 119-124. 
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With calculators such as the Monroe and Marchant, the entries 
for Column (4) are easily determined. The mean (81.59) is run into 
the upper windows by the use of the plus bar, or multiplying keys 
with nothing on the keyboard. The reciprocal of the standard devia- 
tion (.08237) is entered in the keyboard, and build-up will yield the 
entries in (4) above the mean. As the bottom of the step appears in 
the upper windows, the corresponding deviation can be read from 
the lower windows. 

For deviations below the mean the procedure varies with the 
style of the machine. For some machines it is necessary only to re- 
turn to the mean and adjust for division instead of multiplication. 
With other machines it is necessary to clear, set the mean in the up- 
per windows in red figures by the use of the subtraction bar, enter the 
reciprocal of the standard deviation in the keyboard again and pro- 
ceed as above. It is not necessary to calculate the deviation of the bot- 
tom of the last step. This method of calculating dev iations in o-units 
is also convenient in computing z-scores. 

Entries in Column (5) are determined as usual from a table of 
the normal curve. 

Entries in Column (6), the percentage of the theoretical fre- 
quency, are obtained by successive subtraction toward the mean. The 
sum of the entries should be unity. 

The next step is to sum the quotients of each entry in Column 
(3) divided by the corresponding entry in Column (6). Dividends 
and divisors are entered so that their decimal positions in the ma- 
chine remain constant. Quotients are then allowed to sum in the upper 


windows, yielding > =, which, substituted in the formula, gives 
5.833. This value is practically the same as Garrett’s 5.827, yielding 
a P of .97. 

This method eliminates the following computations: deviations 
of the bottoms of the steps in score units, theoretical frequencies, the 
differences between observed frequencies and theoretical frequencies, 
the squares of these differences, and the division of the squares by the 
theoretical frequencies. It involves a formula only slightly more com- 
plicated than the usual one and two steps not appearing in the ordi- 
nary procedure: the squaring of the observed frequencies and the di- 
vision of these squares by the percentages of theoretical frequency. 
The saving in time should be at least 50 per cent. 

PHILIP H. DUBOIs, 
The University of New Mexico. 
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Lewin, Kurt. The Conceptual Representation and the Measurement of Psycho- 
logical Forces. Contributions to Psychological Theory, 1938, vol. 1, No. 4, 
Durham, North Carolina: Duke University Press, pp. 247. $2.00. 


Lewin first undertakes to develop a concept of paths within what he terms 
the life space, a term applied to the totality of those things which “exist psycho- 
logically for a given person.” While he formerly attempted to picture the life 
space as a topological one, he feels now, apparently, that the concept of paths, to 
which he wishes to ascribe the attributes of direction and distance, requires a 
new space which he terms hodological (derived from the Greek word for path, 
hodos, but not, as a humorously inclined critic might imagine, to be regarded as 
synonomous with pathological). In any given hodological space there is supposed 
to be one outstanding or distinguished path from the “region” in which the per- 
son finds himself to the region which represents the goal. Its direction is deter- 
mined by the initial step and is described by naming the region in which the path 
starts and the region it first enters. The distance traversed by the path, how- 
ever, is not definitely defined, and is expressly stated not to be solely a matter 
of the number of regions which it crosses. Only in the special case where one 
path is a part of another can judgments of distance (relative) be made with cer- 
tainty, in which case the whole path must of course extend over a greater dis- 
tance than the part path. 

The author proceeds to picture psychological behavior as “locomotion” along 
these paths by a thing, called “the person,” which is composed of several parts, 
including a motor region, or “motoric,’” and a vital “inner-personal” region. One 
has to be careful not to be deceived by the almost constant use of physical terms 
in a non-physical sense. Thus, by locomotion of the person, one should not un- 
derstand necessarily a movement through physical space brought about by the use 
of the legs (though at times it is so described), because it may be regarded simply 
as a case of “cognitive restructuring” of the environment and may, therefore, occur 
in the absence of movement of any bodily organs. The environment is not the en- 
vironment of the body but that of the person, and it includes such “regions” as the 
mental multiplications and divisions which must be carried out to solve an arith- 
metical problem, as well as the problem itself. Lewin’s concept of the person, which 
is of primary importance in his whole system, does not appear to have a valid psy- 
chological basis. The person resembles the thing Titchener fought against’ thirty 
years ago, namely, an immaterial animal dwelling within the material animal. 
As a matter of fact, the person. is often referred to as the animal. Although in 
the present work only the movements of the animal which occur in the “plane of 
reality” are discussed, the author emphasizes that movements in the various 
planes of irreality are equally important. Just what might be meant by a plane 
in hodological space is not altogether clear. 

To account for the locomotion of the person along the highways of hodo- 
logical space, Lewin constructs an elaborate set of dynamic concepts. In general, 
locomotion is considered to be the resultant of psychological forces, and forces 
are linked with needs of the person. Lewin prefers, however, to avoid direct iden- 
tification of needs with forces by bringing in concepts of valence and tension. 
The strength of a force for locomotion depends upon the valence of the goal, i.e., 
the attractiveness or repulsiveness of the goal, but also depends upon the “dis- 
tance” of the person from the goal. Valences, in turn, it is true, depend upon the 
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needs of the person, but also upon certain non-psychological, “alien” factors, such 
as the existence or non-existence of food in the environment. In addition to val- 
ences, “tensions” are also correlated with needs, so that the force tending to move 
the person towards a goal stands in much the same relation to tension as to val- 
ence; and valence and tension tend to vary concomitantly, the one being, to a 
considerable extent, the effect of need on the goal, and the other that of need on 
the person. The reviewer would suggest that some concept is needed which gives 
planes corresponding to successive moments of time. The need for such a time 
dimension arises from the fact that, so far as can be made out, a hodological plane 
can represent only the life space at a particular moment, i.e., one particular 
space; but locomotion requires time and is accompanied by a restructuring of the 
life space. This restructuring seems to mean a new space. Locomotion, there- 
fore, would seem impossible except from one hodological space into another hodo- 
logical space, and could only occur in a time series of hodological planes or spaces. 

Almost the entire last half of the monograph is devoted to describing in 
terms of the above concepts numerous experiments dealing with the obtaining 
of goals, particularly experiments employing mazes or obstruction boxes. These 
experiments are analyzed in detail as regards their effectiveness in affording 
measures of the forces for locomotion towards the goal, when these forces are 
conceived in the manner described above. 

The limits of this review prohibit any detailed critical estimate of this bold 
and elaborate system of mathematical, quasi-mathematical, spatial, dynamic, 
psychological and pseudo-psychological “constructs.’”’ Obviously, both the truth 
and the meaning of many, if not most, of the innumerable assumptions are open 
to doubt. In general, there might be some question concerning the value of using 
precise mathematical and physical terminology in a way which is not precise. 
The value of the system can at best be only heuristic. 


HERBERT WoopDROW, 
University of Illinois. 
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Dunlap, Jack W. Workbook in Statistical Method. New York: Pren- 
tice-Hall, 1939. Pp. ix + 145. 


A REVIEW 


The author has produced a workbook excellently suited for the 
sort of statistics course which he rightly says is needed — “an inte- 
grated course giving some computation, some assumptions and deriva- 
tions, and considerable opportunity for the interpretation of results.” 
He covers, in the course of 145 pages, somewhat more than the ma- 
terial required for the usual first course in statistics. 

Perhaps the most notable characteristic of the workbook is the 
fact that the instruction is tied up to specific instances in which the 
technique under discussion has been used. Frequent reference is made 
to articles in the field of social sciences—in education and psychology 
particularly — reporting investigations involving techniques illus- 
trative of those being presented. Many exercises involve the inspec- 
tion and evaluation of data from statistical studies reported in the 
literature, affording the student the opportunity to compare his inter- 
pretations with those of the author. 

Useful objective tests are inserted at intervals. 


G. FREDERIC KUDER 
The University of Chicago 
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Correction of an Error 
in 
“A THEORY OF LEARNING AND TRANSFER” 


Dr. Kenneth W. Spence has pointed out an error in our article 
which we wish to correct. 

On page 248 of the December, 1938, issue of Psychometrika we 
stated that Spence assumes “that at the beginning of training the rat 
has tendencies to respond directionally (to the right or left) and 
relatively (to the larger or smaller) ..... We rejected Spence’s defini- 
tion and adopted a directional one when we found it possible to predict 
relational transfer on the basis of directional habits. This prediction 
seemed scientifically more interesting than to assume a relative ten- 
dency at the beginning and then deduce relative transfer later.” 

The foregoing statement of Spence’s view is erroneous. He as- 
sumes no relational tendencies. Instead, he assumes that an animal 
possesses four stimulus-response tendencies: to approach the right; to 
approach the left; to approach one of the two stimuli to be discrimin- 
ated; and, to approach the other stimulus. From the assumption of 
these tendencies, none of which are relational, Spence deduces rela- 
tional transfer. 

Spence’s theory differs from ours in that he assumes four stimu- 
lus-response tendencies, whereas we assume only two, namely, tenden- 
cies to go to the right and to the left. Both theories agree in deducing 
relational transfer from assumptions which involve no relational 
tendencies. 

We greatly regret this error and welcome the opportunity to 
correct it. 


HAROLD GULLIKSEN and DAEL L. WOLFLE 
The University of Chicago 


ERRATA 


In the article “The Multiple Factor Theory in 
Terms of Common Elements,” Psychometrika, 4, 
1939, the term “sine” or “sines” should replace 
“cosine” or “cosines” in the following places: page 
67, lines 17, 18, 31, 33, and 36; page 68, lines 3, 
4, and 22. 


GERTRUDE M. Cox 
Iowa State College 
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